Skip to content

Nightly Terminal-Bench #48

Nightly Terminal-Bench

Nightly Terminal-Bench #48

Triggered via schedule December 12, 2025 00:04
Status Success
Total duration 1h 12m 37s
Artifacts 2
Determine models to test
3s
Determine models to test
Matrix: benchmark
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size Digest
terminal-bench-results-anthropic-claude-sonnet-4-5-20151476908
6.62 MB
sha256:0d5a526c936f6071bfdffd5f30a8a05b04954c00f30cbc34f73c4a1c9d36029e
terminal-bench-results-openai-gpt-5.1-codex-20151476908
5.44 MB
sha256:f7b2bc9b67a0adf00be82047eb9dd1c7d742c9b5cf003fc67fb1b083f966e2f7