feat(responses)!: improve responses + conversations implementations #3810

ashwinb · 2025-10-14T21:42:31Z

This PR updates the Conversation item related types and improves a
couple critical parts of the implemenation:

it creates a streaming output item for the final assistant message output by
the model. until now we only added content parts and included that
message in the final response.
rewrites the conversation update code completely to account for items
other than messages (tool calls, outputs, etc.)

Test Plan

Used the test script from llamastack/llama-stack-client-python#281 for this

TEST_API_BASE_URL=http://localhost:8321/v1 \
  pytest tests/integration/test_agent_turn_step_events.py::test_client_side_function_tool -xvs

This PR updates the Conversation item related types and improves a couple critical parts of the implemenation: - it creates a streaming output item for the final assistant message output by the model. until now we only added content parts and included that message in the final response. - rewrites the conversation update code completely to account for items other than messages (tool calls, outputs, etc.)

llama_stack/providers/inline/agents/meta_reference/responses/openai_responses.py

…nd corresponding client-side PR

luis5tb · 2025-10-15T09:38:49Z

llama_stack/apis/agents/openai_responses.py

-    |
-    # Fallback to the generic message type as a last resort
-    OpenAIResponseMessage,
+    | OpenAIResponseOutputMessageMCPCall


This is similar to the change I'm proposing at #3385, but there I'm adding diractly OpenAIResponseOutput as a possible input, to always keep both list in sync (based on a comment I received in that PR).

As the other PR is simpler/shorter, perhaps we can go with it and rebase this on top

@luis5tb It is actually far more complex, even my changes are only a patch. We need to do a thorough review of everything carefully. My changes here were not motivated by trying to override your PR but came via an independent motivation. Unfortunate your PR went languishing for a long time. I also need to run tests still.

@luis5tb in fact I went your route first but then looked at OpenAI's definitions. They are very subtly different! It may be you are right but we need to do an automatic thorough check on all Responses type definitions -- there are way too many holes there right now.

leseb · 2025-10-15T10:15:21Z

@github-actions run precommit

github-actions · 2025-10-15T10:15:31Z

⏳ Running pre-commit hooks on PR #3810...

leseb

Did a pass and nothing caught my eyes. Thanks!

github-actions · 2025-10-15T10:16:56Z

✅ Pre-commit hooks completed successfully!

🔧 Changes have been committed and pushed to the PR branch.

franciscojavierarceo · 2025-10-15T10:30:03Z

llama_stack/apis/conversations/conversations.py

-    | OpenAIResponseOutputMessageFunctionToolCall
-    | OpenAIResponseOutputMessageFileSearchToolCall
    | OpenAIResponseOutputMessageWebSearchToolCall
+    | OpenAIResponseOutputMessageFileSearchToolCall


nice i was planning to follow up on these after the latest changes. thanks!

franciscojavierarceo

🚀

ashwinb · 2025-10-15T16:14:09Z

@leseb wtf how did that pre-commit magic work man! This is in my forked Repo!

ashwinb · 2025-10-15T16:26:48Z

Still fixing some responses tests. Not broken by this change honestly, but still (since these things are not in CI yet)...

ashwinb · 2025-10-15T16:34:18Z

Note that CI will appear red on this PR because llamastack/llama-stack-client-python#281 needs to be landed concurrently.

ehhuang · 2025-10-15T16:55:55Z

llama_stack/providers/inline/agents/meta_reference/responses/openai_responses.py

+                messages = await convert_response_input_to_chat_messages(input)
+            else:
+                # Use stored messages directly and convert only new input
+                messages = stored_messages or []


can stored_messages actually be None here?

ehhuang · 2025-10-15T16:56:34Z

llama_stack/providers/inline/agents/meta_reference/responses/openai_responses.py

+                    filter(lambda x: not isinstance(x, OpenAISystemMessageParam), orchestrator.final_messages)
                )
+                if store:
+                    # TODO: we really should work off of output_items instead of "final_messages"


does this still apply?

followup on llamastack#3810 Signed-off-by: Sébastien Han <seb@redhat.com>

followup on #3810 Signed-off-by: Sébastien Han <seb@redhat.com>

# What does this PR do? Introduces two main fixes to enhance the stability of Responses API when dealing with tool calling responses and structured outputs. ### Changes Made 1. It added OpenAIResponseOutputMessageMCPCall and ListTools to OpenAIResponseInput but #3810 got merge that did the same in a different way. Still this PR does it in a way that keep the sync between OpenAIResponsesOutput and the allowed objects in OpenAIResponseInput. 2. Add protection in case self.ctx.response_format does not have type attribute BREAKING CHANGE: OpenAIResponseInput now uses OpenAIResponseOutput union type. This is semantically equivalent - all previously accepted types are still supported via the OpenAIResponseOutput union. This improves type consistency and maintainability.

ashwinb requested review from bbrowning, ehhuang, franciscojavierarceo, hardikjshah, leseb, mattf, raghotham, reluctantfuturist, slekkala1, terrytangyuan and yanxi0830 as code owners October 14, 2025 21:42

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 14, 2025

update comment

540970a

ehhuang reviewed Oct 14, 2025

View reviewed changes

llama_stack/providers/inline/agents/meta_reference/responses/openai_responses.py Outdated Show resolved Hide resolved

llama_stack/providers/inline/agents/meta_reference/responses/openai_responses.py Outdated Show resolved Hide resolved

ashwinb added 4 commits October 14, 2025 15:50

fix tests?

3f468b1

pre-commit

e3459fd

use a messages store for conversations, dont store system message

497ccc2

update tests since Agent SDK is changing, CI will fail now until I la…

85f7ec0

…nd corresponding client-side PR

ashwinb force-pushed the agent_rewrite branch from af111e9 to 85f7ec0 Compare October 15, 2025 05:22

luis5tb reviewed Oct 15, 2025

View reviewed changes

leseb approved these changes Oct 15, 2025

View reviewed changes

franciscojavierarceo reviewed Oct 15, 2025

View reviewed changes

franciscojavierarceo approved these changes Oct 15, 2025

View reviewed changes

ashwinb added 2 commits October 15, 2025 09:24

new recordings

0a3f527

Merge remote-tracking branch 'origin/main' into agent_rewrite

57b3d14

ashwinb force-pushed the agent_rewrite branch from 442399f to 57b3d14 Compare October 15, 2025 16:25

precommit

eba2eb2

ashwinb added 2 commits October 15, 2025 09:29

use extra_body param

1d3e032

update assertion

e3cfbb5

ashwinb merged commit e9b4278 into llamastack:main Oct 15, 2025
20 of 22 checks passed

ashwinb deleted the agent_rewrite branch October 15, 2025 16:36

ehhuang reviewed Oct 15, 2025

View reviewed changes

luis5tb mentioned this pull request Oct 16, 2025

fix!: Enhance response API support to not fail with tool calling #3385

Merged

leseb added a commit to leseb/llama-stack that referenced this pull request Oct 16, 2025

chore: update agent call

463e96c

followup on llamastack#3810 Signed-off-by: Sébastien Han <seb@redhat.com>

leseb mentioned this pull request Oct 16, 2025

chore: update agent call #3824

Merged

leseb added a commit to leseb/llama-stack that referenced this pull request Oct 16, 2025

chore: update agent call

9af7a2c

followup on llamastack#3810 Signed-off-by: Sébastien Han <seb@redhat.com>

leseb added a commit that referenced this pull request Oct 16, 2025

chore: update agent call (#3824)

0c36849

followup on #3810 Signed-off-by: Sébastien Han <seb@redhat.com>

feat(responses)!: improve responses + conversations implementations #3810

feat(responses)!: improve responses + conversations implementations #3810

Uh oh!

Conversation

ashwinb commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test Plan

Uh oh!

Uh oh!

Uh oh!

luis5tb Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

ashwinb Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

ashwinb Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

leseb commented Oct 15, 2025

Uh oh!

github-actions bot commented Oct 15, 2025

Uh oh!

leseb left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Oct 15, 2025

Uh oh!

franciscojavierarceo Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

franciscojavierarceo left a comment

Choose a reason for hiding this comment

Uh oh!

ashwinb commented Oct 15, 2025

Uh oh!

ashwinb commented Oct 15, 2025

Uh oh!

ashwinb commented Oct 15, 2025

Uh oh!

Uh oh!

ehhuang Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

ehhuang Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ashwinb commented Oct 14, 2025 •

edited

Loading

leseb left a comment •

edited

Loading