Feature/tracer #113

AlexanderOnischenko · 2026-01-23T16:00:29Z

No description provided.

…tures-from-events.jsonl Guard planner_input dependency when emitting replay_point requires

…com:AlexanderOnischenko/fetchgraph into codex/reorganize-fixtures-and-migrate-paths

…tures-and-migrate-paths Clarify fixture help defaults in Makefile

…com:AlexanderOnischenko/fetchgraph into codex/reorganize-fixtures-and-migrate-paths

…tures-and-migrate-paths Codex/reorganize fixtures and migrate paths

AlexanderOnischenko · 2026-01-23T16:01:09Z

@codex, please review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: d9076ec05a

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-01-23T16:08:13Z

tests/test_replay_fixtures.py

+        bucket_dir = FIXTURES_ROOT / bucket
+        if not bucket_dir.exists():
+            continue
+        for path in bucket_dir.rglob("*.json"):
+            paths.append((bucket, path))


Ignore resource JSONs when collecting replay fixtures

When a replay fixture is exported with --with-requires, export_replay_fixture/write_bundle copies dependent resource files into the same output directory, and those resources can be .json files. _iter_fixture_paths currently glob/rglob’s every *.json under tests/fixtures/replay_points/{fixed,known_bad}, so resource payloads get parameterized as fixtures and then test_replay_fixture fails at assert event.get("type") == "replay_point" because resource files don’t contain a replay payload. This breaks tests as soon as any fixture includes JSON resources; consider filtering by type in _iter_fixture_paths or storing resources in a subdirectory that isn’t globbed.

Useful? React with 👍 / 👎.

…er-v2-features-and-export Tracer: observed-first `replay_case` v2, bundle export/runtime, CLI, and remove generated fixtures

AlexanderOnischenko · 2026-01-26T04:34:40Z

@codex, please review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: be0ea0f3e2

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-01-26T04:40:25Z

examples/demo_qa/runner.py

+    events_path = run_dir / "events.jsonl"
+    case_logger = event_logger.for_case(case.id, events_path) if event_logger else EventLogger(events_path, run_id, case.id)


Respect --events off when creating case logger

Because run_one always instantiates an EventLogger when event_logger is None, per-case events.jsonl emission becomes unconditional. That means batch.handle_batch’s --events off path (which intentionally passes event_logger=None) no longer disables event logging, and the agent still records events to disk; this can unexpectedly write large trace files or capture data that users explicitly opted out of logging. Consider only creating a case logger when event_logger is provided (or add an explicit flag) so the CLI semantics for --events still work as advertised.

Useful? React with 👍 / 👎.

…ing-with-events-off Respect events flag when creating case loggers

This reverts commit 2685c7dc10e1586321b13073d8a586b631aab156.

…een-to-ensure-validation fixture-green: validate by default, add --expected-from, rollback on mismatch with improved diffs

* Allow explicit disabling of event logging * Refine event logger sentinel typing

* Fix relational selector validation * Add replay validator checks for fixtures * Use shared replay validators in known_bad tests * Require replay validators in fixture-green and fixed tests

AlexanderOnischenko · 2026-01-26T17:47:24Z

@codex, please review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 7091ab385c

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-01-26T18:12:37Z

src/fetchgraph/planning/normalize/plan_normalizer.py

+                log_replay_case(
+                    replay_logger,
+                    id="plan_normalize.spec_v1",
+                    meta={
+                        "provider": spec.provider,


Include planner_input extra in replay_case requires

The new replay handler falls back to ctx.extras["planner_input_v1"] when provider_info_snapshot is missing (see src/fetchgraph/replay/handlers/plan_normalize.py), but this log_replay_case call never populates requires, so export_replay_case_bundle will not carry the planner_input_v1 extra (or schema resource) into the case bundle. That means replays for providers not present in the catalog/registry will silently use a minimal ProviderInfo and can normalize selectors differently from the original run. Consider adding a requires entry for planner_input_v1 (and schema resource if needed) when emitting the replay_case.

Useful? React with 👍 / 👎.

* Improve replay requires resolution for plan normalization * Refine replay dependency heuristics

AlexanderOnischenko · 2026-01-26T19:51:44Z

@codex, please review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 45eb6bef44

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-01-26T19:59:55Z

src/fetchgraph/tracer/cli.py

+def _resolve_case_dir_from_run_id(*, data_dir: Path, runs_subdir: str, run_id: str, case_id: str) -> Path:
+    runs_root = data_dir / runs_subdir / run_id / "cases"
+    if not runs_root.exists():
+        raise FileNotFoundError(f"Run directory does not exist: {runs_root}")
+    case_dirs = sorted(runs_root.glob(f"{case_id}_*"), key=lambda p: p.stat().st_mtime, reverse=True)


Resolve run_id using run metadata, not directory name

The --run-id path resolution assumes the run_id is the directory name by building data_dir/<runs_subdir>/<run_id>/cases, but batch runs create directories named like <timestamp>_<cases_stem> while storing the real run_id only in run_meta.json/events, so --run-id will raise “Run directory does not exist” even when a valid run_id is supplied from metadata. Consider scanning run directories for a matching run_meta.json run_id (or clarifying that --run-id expects the run folder name) to make this flag work for batch runs.

Useful? React with 👍 / 👎.

…cs (#131) * Resolve run ids from history * Fix run and case dir resolution * Tighten events/run-dir resolve output * Clarify events run_dir requirements

AlexanderOnischenko · 2026-01-27T03:20:54Z

@codex, please review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: d3bf518b65

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-01-27T03:27:47Z

src/fetchgraph/tracer/fixture_tools.py

+            rel = _safe_resource_path(file_name, stem=stem)
+            if rel.parts[:3] == ("resources", stem, resource_id):
+                continue
+            if rel.parts[:1] == ("resources",) and len(rel.parts) >= 2:
+                rel_tail = Path(*rel.parts[2:])
+            else:


Strip resource_id when migrating resource paths

When a bundle’s data_ref.file already includes a resource-id segment (e.g., resources/<old_stem>/<resource_id>/file.txt), the current migration logic only drops the first two path parts and then re-prefixes resource_id, producing resources/<stem>/<resource_id>/<resource_id>/file.txt. That yields a non-canonical layout and updates data_ref.file to the doubled path, so resources end up in an unexpected location and tooling that assumes resources/<stem>/<resource_id>/... (per tracer docs) won’t find them. Consider checking rel.parts[2] == resource_id and dropping three parts in that case.

Useful? React with 👍 / 👎.

AlexanderOnischenko and others added 28 commits January 23, 2026 12:03

Guard replay requires for planner input

acd1380

Make ALL flag boolean in fixture target

6c6c7b5

Fix replay plan normalize imports

509fbe4

Allow module-level skip for replay fixtures

e60e312

Merge pull request #107 from AlexanderOnischenko/codex/add-replay-fix…

b232f1a

…tures-from-events.jsonl Guard planner_input dependency when emitting replay_point requires

version inc - tracer

d73c690

Clarify fixture help defaults

809897d

Fix replay fixture typing

7e1c909

Validate replay fixtures for relational selectors

2d46fa2

Add fixture management commands

5408b02

Decouple fixture commands from check

17fc8ea

Fix fixture tools git paths and case ids

35c825e

Remove unused replay bucket helper

a76f976

Improve replay fixture debug output

b3eba81

новые трейсы с падениями (перемещаем)

344b301

Precheck trace conflicts and rollback fixture moves

c5bcb45

Merge branch 'codex/reorganize-fixtures-and-migrate-paths' of github.…

e1058c2

…com:AlexanderOnischenko/fetchgraph into codex/reorganize-fixtures-and-migrate-paths

Improve fixture tool discovery

6b4cef0

Merge branch 'codex/reorganize-fixtures-and-migrate-paths' of github.…

868774a

…com:AlexanderOnischenko/fetchgraph into codex/reorganize-fixtures-and-migrate-paths

Use git mv for tracked rollbacks

456618d

Allow unbucketed replay fixtures

921de37

Report promoted fixture paths

fd18fb4

Merge branch 'codex/reorganize-fixtures-and-migrate-paths' of github.…

55fda76

…com:AlexanderOnischenko/fetchgraph into codex/reorganize-fixtures-and-migrate-paths

Fix rerun hint for root fixtures

d50d110

Drop zip plan-trace loading

a6da9e3

Merge pull request #108 from AlexanderOnischenko/codex/reorganize-fix…

5c2a437

…tures-and-migrate-paths Clarify fixture help defaults in Makefile

Merge branch 'codex/reorganize-fixtures-and-migrate-paths' of github.…

b0d8f70

…com:AlexanderOnischenko/fetchgraph into codex/reorganize-fixtures-and-migrate-paths

Merge pull request #112 from AlexanderOnischenko/codex/reorganize-fix…

d9076ec

…tures-and-migrate-paths Codex/reorganize fixtures and migrate paths

chatgpt-codex-connector bot reviewed Jan 23, 2026

View reviewed changes

Merge pull request #118 from AlexanderOnischenko/codex/implement-trac…

be0ea0f

…er-v2-features-and-export Tracer: observed-first `replay_case` v2, bundle export/runtime, CLI, and remove generated fixtures

chatgpt-codex-connector bot reviewed Jan 26, 2026

View reviewed changes

AlexanderOnischenko added 7 commits January 26, 2026 08:17

Respect events flag for case logging

c91a472

Merge pull request #121 from AlexanderOnischenko/codex/fix-event-logg…

764aa87

…ing-with-events-off Respect events flag when creating case loggers

Update fixture-green validation and diagnostics

bc96fac

Fix fixture-green observed hints

0ac1dc7

Deduplicate diff path helper

e248500

Guard fixture-rm path usage for BUCKET=all

4ed2a7b

Revert "Guard fixture-rm path usage for BUCKET=all"

0ba8f5b

This reverts commit 2685c7dc10e1586321b13073d8a586b631aab156.

AlexanderOnischenko force-pushed the feature/tracer branch from 764aa87 to be0ea0f Compare January 26, 2026 06:31

AlexanderOnischenko and others added 8 commits January 26, 2026 09:39

Expose diff utils and adjust fixture-rm heuristic

bd409af

Add fixture-green name heuristic

86c2b3b

Merge pull request #123 from AlexanderOnischenko/codex/fix-fixture-gr…

aaaadb7

…een-to-ensure-validation fixture-green: validate by default, add --expected-from, rollback on mismatch with improved diffs

Default-enable event logging; allow explicit disable via None (#126)

8129cfd

* Allow explicit disabling of event logging * Refine event logger sentinel typing

Fix relational selector validation (#125)

28a483d

* Fix relational selector validation * Add replay validator checks for fixtures * Use shared replay validators in known_bad tests * Require replay validators in fixture-green and fixed tests

Add validator for resource_read replay (#127)

c3b6935

обновлена документация

6ab3c51

Fix type narrowing for replay utilities (#128)

7091ab3

chatgpt-codex-connector bot reviewed Jan 26, 2026

View reviewed changes

Improve plan_normalize replay dependency handling (#129)

45eb6be

* Improve replay requires resolution for plan normalization * Refine replay dependency heuristics

chatgpt-codex-connector bot reviewed Jan 26, 2026

View reviewed changes

AlexanderOnischenko added 2 commits January 27, 2026 01:46

Resolve tracer --run-id from history/run_meta and improve CLI semanti…

4ce2bab

…cs (#131) * Resolve run ids from history * Fix run and case dir resolution * Tighten events/run-dir resolve output * Clarify events run_dir requirements

Fix optional path handling in tracer (#132)

d3bf518

chatgpt-codex-connector bot reviewed Jan 27, 2026

View reviewed changes

AlexanderOnischenko added 2 commits January 28, 2026 22:57

сделал проходящие тесты в корзине known_bad зелеными

2507210

форматирование вывода

7b5597d

		events_path = run_dir / "events.jsonl"
		case_logger = event_logger.for_case(case.id, events_path) if event_logger else EventLogger(events_path, run_id, case.id)

Feature/tracer #113

Are you sure you want to change the base?

Feature/tracer #113

Uh oh!

Conversation

AlexanderOnischenko commented Jan 23, 2026

Uh oh!

AlexanderOnischenko commented Jan 23, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

AlexanderOnischenko commented Jan 26, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Jan 26, 2026

Choose a reason for hiding this comment

Uh oh!

AlexanderOnischenko commented Jan 26, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Jan 26, 2026

Choose a reason for hiding this comment

Uh oh!

AlexanderOnischenko commented Jan 26, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Jan 26, 2026

Choose a reason for hiding this comment

Uh oh!

AlexanderOnischenko commented Jan 27, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants