Rough API Usage Estimates for evaluation on FutureX

Hi MiroFlow team, thank you for open sourcing MiroFlow and it's a great repo! I'm reading your code and trying to run the FutureX benchmark and am wondering the rough API costs. For example if I were to run the `./scripts/run_evaluate_multiple_runs_futurex.sh` for majority voting over 3 runs, what would the rough cost be for OpenRouter, search API, and others combined (assuming I use `agent_quickstart_1.yaml` and I add tool-searching, o3_hint: true, and o3_final_answer: true)? 

Or could you please share how much money did it cost you to run the submission you gave to FutureX? The "GPT-5 (MiroFlow-preview DR)" currently on FutureX's leaderboard. 

Any insights are highly appreciated as I am paying out of pocket for the API credits. 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rough API Usage Estimates for evaluation on FutureX #59

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Rough API Usage Estimates for evaluation on FutureX #59

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions