.github/workflows/nightly-terminal-bench.yml

Nightly Terminal-Bench #2

Workflow file for this run

.github/workflows/nightly-terminal-bench.yml at cd58afa

	name: Nightly Terminal-Bench

	on:
	schedule:
	# Run full benchmark suite (~80 tasks) every night at midnight UTC
	- cron: '0 0 * * *'
	workflow_dispatch:
	inputs:
	models:
	description: 'Models to test (comma-separated, or "all" for both)'
	required: false
	default: 'all'
	type: string

	jobs:
	determine-models:
	name: Determine models to test
	runs-on: ubuntu-latest
	outputs:
	models: ${{ steps.set-models.outputs.models }}
	steps:
	- name: Set models matrix
	id: set-models
	run: \|
	if [ "${{ inputs.models }}" = "all" ] \|\| [ -z "${{ inputs.models }}" ]; then
	echo 'models=["anthropic:claude-sonnet-4-5","openai:gpt-5-codex"]' >> $GITHUB_OUTPUT
	else
	# Convert comma-separated to JSON array
	models="${{ inputs.models }}"
	models_json=$(echo "$models" \| jq -R -s -c 'split(",") \| map(gsub("^\\s+\|\\s+$"; ""))')
	echo "models=$models_json" >> $GITHUB_OUTPUT
	fi

	benchmark:
	name: ${{ matrix.model }}
	needs: determine-models
	strategy:
	matrix:
	model: ${{ fromJSON(needs.determine-models.outputs.models) }}
	fail-fast: false
	uses: ./.github/workflows/terminal-bench.yml
Check failure on line 41 in .github/workflows/nightly-terminal-bench.yml View workflow run for this annotation GitHub Actions / .github/workflows/nightly-terminal-bench.yml Invalid workflow file `error parsing called workflow ".github/workflows/nightly-terminal-bench.yml" -> "./.github/workflows/terminal-bench.yml" (source branch with sha:cd58afa7e63dcee794016e75d297e64216edabae) : (Line: 134, Col: 17): Unrecognized function: 'replace'. Located at position 22 within expression: inputs.model_name && replace(format('{0}-{1}', inputs.model_name, github.run_id), ':', '-') \|\| format('{0}', github.run_id)`
	with:
	model_name: ${{ matrix.model }}
	thinking_level: 'high'
	dataset: 'terminal-bench-core==0.1.1'
	concurrency: '4'
	livestream: true
	secrets:
	ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
	OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Nightly Terminal-Bench #2

Workflow file

Nightly Terminal-Bench #2

Uh oh!

Jobs

Run details

Workflow file for this run

GitHub Actions / .github/workflows/nightly-terminal-bench.yml