Questions about unified action space and Benchmrk action categories ？

Thanks for your excellent work.

I found that the performance issues on GUI-Odyeesy and Android-Control were not reproducible. 
- https://github.com/inclusionAI/UI-Venus/issues/9
- https://github.com/inclusionAI/UI-Venus/issues/11
- https://github.com/inclusionAI/UI-Venus/issues/18

During training and evaluation, did you customize prompts and action spaces for each dataset ? For example, GUI-Odyeesy does not have `Launch(app=app_name)`, but Android-Control has a similar action?

By using the same action space as the benchmark, the model reduces the number of unexpected actions predicted, and performance improves.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Questions about unified action space and Benchmrk action categories ？ #22

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Questions about unified action space and Benchmrk action categories ？ #22

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions