Nightly Terminal-Bench #48
nightly-terminal-bench.yml
on: schedule
Determine models to test
3s
Matrix: benchmark
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
terminal-bench-results-anthropic-claude-sonnet-4-5-20151476908
|
6.62 MB |
sha256:0d5a526c936f6071bfdffd5f30a8a05b04954c00f30cbc34f73c4a1c9d36029e
|
|
|
terminal-bench-results-openai-gpt-5.1-codex-20151476908
|
5.44 MB |
sha256:f7b2bc9b67a0adf00be82047eb9dd1c7d742c9b5cf003fc67fb1b083f966e2f7
|
|