Skip to content

Rough API Usage Estimates for evaluation on FutureX #59

@jmnian

Description

@jmnian

Hi MiroFlow team, thank you for open sourcing MiroFlow and it's a great repo! I'm reading your code and trying to run the FutureX benchmark and am wondering the rough API costs. For example if I were to run the ./scripts/run_evaluate_multiple_runs_futurex.sh for majority voting over 3 runs, what would the rough cost be for OpenRouter, search API, and others combined (assuming I use agent_quickstart_1.yaml and I add tool-searching, o3_hint: true, and o3_final_answer: true)?

Or could you please share how much money did it cost you to run the submission you gave to FutureX? The "GPT-5 (MiroFlow-preview DR)" currently on FutureX's leaderboard.

Any insights are highly appreciated as I am paying out of pocket for the API credits.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions