Skip to content

docs: Add evolved architecture diagram with three-phase pipeline#970

Merged
abrichr merged 2 commits intomainfrom
feature/evolved-architecture-diagram
Jan 17, 2026
Merged

docs: Add evolved architecture diagram with three-phase pipeline#970
abrichr merged 2 commits intomainfrom
feature/evolved-architecture-diagram

Conversation

@abrichr
Copy link
Member

@abrichr abrichr commented Jan 17, 2026

Summary

  • Adds a comprehensive evolved architecture diagram to README.md and docs/architecture-evolution.md
  • Modernizes terminology to align with GUI agent research literature
  • Shows implementation status with solid (implemented) vs dashed (future) styling

Key Changes

Three-Phase Pipeline

  1. DEMONSTRATE (Observation Collection) - capture, privacy
  2. LEARN (Policy Acquisition) - retrieval, ML, fine-tuning
  3. EXECUTE (Agent Deployment) - grounding, evals, safety gate

Core Concepts Visualized

  • Demo-conditioned prompting: The core innovation - "show, don't tell"
  • Policy/Grounding separation: Policy decides what, Grounding determines where
  • Safety Gate: Runtime validation layer before action execution
  • Multi-source data ingestion: Human demos, synthetic data, benchmarks
  • Evaluation-driven feedback loops: Success traces become new training data

The Abstraction Ladder

Literal -> Symbolic -> Template -> Semantic -> Goal

Progressive generalization from raw events to goal-level automation.

Retrieval in BOTH Training AND Evaluation

Demonstrates how demo conditioning improves accuracy from 33% to 100% on first-action benchmarks.

Test plan

  • Verify Mermaid diagrams render correctly on GitHub
  • Confirm terminology aligns with docs/architecture-evolution.md
  • Check all links work properly

Generated with Claude Code

abrichr and others added 2 commits January 16, 2026 23:46
…lution.md

- Three-phase pipeline: DEMONSTRATE, LEARN, EXECUTE with modernized terminology
- Demo-conditioned prompting as core innovation (show, don't tell)
- Policy/Grounding separation in EXECUTE phase
- Safety Gate as runtime layer with validation and risk assessment
- Multi-source data ingestion (human demos, synthetic data, benchmarks)
- Evaluation-driven feedback loops (success traces become training data)
- Abstraction Ladder visualization (Literal -> Symbolic -> Template -> Semantic -> Goal)
- Retrieval used in BOTH training AND evaluation for demo conditioning
- Solid lines = implemented, dashed = future

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
- Generated 1568x2132 PNG from Mermaid diagram
- Shows three-phase pipeline (DEMONSTRATE → LEARN → EXECUTE)
- Includes model layer and data flow connections

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
@abrichr
Copy link
Member Author

abrichr commented Jan 17, 2026

Rendered Architecture Diagram

Architecture Diagram

Three-Phase Pipeline:

  1. DEMONSTRATE - Capture human demos, apply privacy, store in demo library
  2. LEARN - Retrieval path (embed→index→search) + Training path (load→train→checkpoint)
  3. EXECUTE - Observe→Policy→Ground→Act with Safety Gate and Evaluation

The Mermaid source is in the README for GitHub rendering, and this PNG is for documentation/offline use.

@abrichr abrichr merged commit b4c7024 into main Jan 17, 2026
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant