fix!: Enhance response API support to not fail with tool calling #3385

luis5tb · 2025-09-09T13:05:15Z

What does this PR do?

Introduces two main fixes to enhance the stability of Responses API when dealing with tool calling responses and structured outputs.

Changes Made

It added OpenAIResponseOutputMessageMCPCall and ListTools to OpenAIResponseInput but feat(responses)!: improve responses + conversations implementations #3810 got merge that did the same in a different way. Still this PR does it in a way that keep the sync between OpenAIResponsesOutput and the allowed objects in OpenAIResponseInput.
Add protection in case self.ctx.response_format does not have type attribute

BREAKING CHANGE: OpenAIResponseInput now uses OpenAIResponseOutput union type.
This is semantically equivalent - all previously accepted types are still supported
via the OpenAIResponseOutput union. This improves type consistency and maintainability.

jaideepr97 · 2025-09-09T14:10:06Z

hi @luis5tb would you mind elaborating a little on what the exact issue you observed is and how your PR addresses it?

ashwinb

as @jaideepr97 noted, the PR summary is not quite sufficient to justify the substantial changes particularly to the API shape also.

luis5tb · 2025-09-11T05:54:31Z

Thanks! Yes, I'll update the description with more information. The TLDR is that when using GPT-OSS model (with remote-vllm) I'm getting a lot of issues related to the conversion/generation of Messages between VLLM, OpenAI and LlamaStack formats.

Regarding the API changes:

I found that after calling an MCP tool and calling again the vllm inference endpoint with the previous responses ID, there was a need to include the OpenAIResponseOutputMessageMCPCall/ListTools as a possible OpenAIResponseInput for that request, otherwise there is an error
The modification on OpenAIToolMessageParam was because at the responses/tool_executor, when building the input_message we are adding OpenAIChatCompletionContentPartImageParam, so it is not just text. This actually still have problems with the nested objects so I make it str too (at least for now)

The other changes are basically to handle cases that probably were not considered and may be due to final implementation on VLLM and the objects it returns. All the changes are related to ensuring the format of the messages can be parsed by the different components (vllm and llamastack). Perhaps the base problem comes from LlamaStack translating the responses API calls to VLLM into chat_completion API calls, and that makes the message parsing a bit difficult

jwm4 · 2025-09-23T11:32:43Z

@luis5tb Are you still planning to update the PR description?

luis5tb · 2025-09-23T12:02:58Z

@luis5tb Are you still planning to update the PR description?

Yes! I was testing the pieces that are not needed anymore due to changes in vLLM and what else what needed due to new changes in too. I'll update later today or first thing tomorrow morning

llama_stack/providers/remote/inference/vllm/vllm.py

luis5tb · 2025-09-24T15:21:34Z

@jwm4 @ashwinb @jaideepr97 Let me know if the current description is enough or more/different details are needed

jaideepr97

still seems to be failing integration tests though
@luis5tb since you're changing some datatypes you will need to update the api conformance test as well

luis5tb · 2025-10-07T05:40:58Z

still seems to be failing integration tests though @luis5tb since you're changing some datatypes you will need to update the api conformance test as well

Integration test fixed.

Regarding the api conformance tests, I'm not entirely sure how to update/fix those. Can you give me some hints? I thought it was about executing the script to update the openapi spec (run_openapi_generator.sh). Note this change is only adding new types supported as input, not removing the existing ones, so there is no breaking on the API, was what accepted is still accepted. Only difference is that a couple of other types are accepted too

jaideepr97 · 2025-10-07T14:14:55Z

since you're changing some datatypes you will need to update the api conformance test as well

Regarding the api conformance tests, I'm not entirely sure how to update/fix those. Can you give me some hints? I thought it was about executing the script to update the openapi spec (run_openapi_generator.sh). Note this change is only adding new types supported as input, not removing the existing ones, so there is no breaking on the API, was what accepted is still accepted. Only difference is that a couple of other types are accepted too

I guess it is technically a breaking change in the spec since you're removing fields, even though you are replacing them with the parent object. The test seems to be directly comparing removals and additions

error	[request-property-one-of-removed] at docs/static/llama-stack-spec.yaml	
	in API POST /v1/responses
		removed '#/components/schemas/OpenAIResponseOutputMessageWebSearchToolCall, #/components/schemas/OpenAIResponseOutputMessageFileSearchToolCall, #/components/schemas/OpenAIResponseOutputMessageFunctionToolCall, #/components/schemas/OpenAIResponseMCPApprovalRequest' from the 'input/oneOf[subschema #2]/items/' request property 'oneOf' list

though I'm not sure why it also errors out on new additions
@cdoern maybe able to advise here

cdoern · 2025-10-07T15:14:30Z

@jaideepr97 these types of changes are ok, just prefix the title of the PR with fix!: or put BREAKING CHANGE: ... with a description in the commit and the test will pass.

luis5tb · 2025-10-07T20:18:20Z

@jaideepr97 these types of changes are ok, just prefix the title of the PR with fix!: or put BREAKING CHANGE: ... with a description in the commit and the test will pass.

Thanks! I updated the PR with that prefix + breaking change information

luis5tb · 2025-10-15T05:24:39Z

as @jaideepr97 noted, the PR summary is not quite sufficient to justify the substantial changes particularly to the API shape also.

done

ashwinb

lgtm

luis5tb requested review from ashwinb, bbrowning, ehhuang, hardikjshah, leseb, mattf, raghotham, reluctantfuturist, slekkala1, terrytangyuan and yanxi0830 as code owners September 9, 2025 13:05

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Sep 9, 2025

luis5tb changed the title ~~Enhance response API support to not fail with tool calling~~ fix: Enhance response API support to not fail with tool calling Sep 9, 2025

luis5tb force-pushed the responses-api-with-tools branch 2 times, most recently from ab48a50 to 7e080db Compare September 9, 2025 13:41

luis5tb force-pushed the responses-api-with-tools branch 2 times, most recently from 751c019 to b7e3351 Compare September 10, 2025 13:59

ashwinb requested changes Sep 10, 2025

View reviewed changes

luis5tb force-pushed the responses-api-with-tools branch from b7e3351 to 3233b5f Compare September 11, 2025 14:01

luis5tb mentioned this pull request Sep 15, 2025

Responses v2 support lightspeed-core/lightspeed-stack#540

Closed

18 tasks

luis5tb force-pushed the responses-api-with-tools branch from 3233b5f to 042d118 Compare September 16, 2025 06:05

luis5tb force-pushed the responses-api-with-tools branch from 042d118 to ab8beaf Compare September 23, 2025 07:30

luis5tb force-pushed the responses-api-with-tools branch from 64e53bf to d0b562f Compare September 24, 2025 08:56

bbrowning reviewed Sep 24, 2025

View reviewed changes

llama_stack/providers/remote/inference/vllm/vllm.py Outdated Show resolved Hide resolved

luis5tb force-pushed the responses-api-with-tools branch from d0b562f to 4c41608 Compare September 24, 2025 15:01

luis5tb force-pushed the responses-api-with-tools branch 4 times, most recently from d33043f to 93d5894 Compare October 3, 2025 14:01

luis5tb requested review from bbrowning and jaideepr97 October 3, 2025 14:07

luis5tb force-pushed the responses-api-with-tools branch from 93d5894 to 597dc2d Compare October 5, 2025 08:35

luis5tb requested a review from franciscojavierarceo as a code owner October 5, 2025 08:35

jaideepr97 approved these changes Oct 6, 2025

View reviewed changes

luis5tb force-pushed the responses-api-with-tools branch from 597dc2d to a8508ad Compare October 7, 2025 06:01

luis5tb changed the title ~~fix: Enhance response API support to not fail with tool calling~~ fix!: Enhance response API support to not fail with tool calling Oct 7, 2025

luis5tb force-pushed the responses-api-with-tools branch 6 times, most recently from 88a5b13 to 4389018 Compare October 13, 2025 08:41

luis5tb mentioned this pull request Oct 15, 2025

feat(responses)!: improve responses + conversations implementations #3810

Merged

luis5tb force-pushed the responses-api-with-tools branch from 4389018 to 1296779 Compare October 16, 2025 05:28

luis5tb added 2 commits October 23, 2025 11:23

Update OpenAIResponseInput to include OpenAIResponseOutput

bf79503

Add protection to response_format

37a9d4f

luis5tb force-pushed the responses-api-with-tools branch from 1296779 to 37a9d4f Compare October 23, 2025 09:23

fix client-sdks/stainless/openapi.yml API spec

fb26f5d

ashwinb approved these changes Oct 27, 2025

View reviewed changes

ashwinb merged commit 63422e5 into llamastack:main Oct 27, 2025
24 checks passed

fix!: Enhance response API support to not fail with tool calling #3385

fix!: Enhance response API support to not fail with tool calling #3385

Uh oh!

Conversation

luis5tb commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Changes Made

Uh oh!

jaideepr97 commented Sep 9, 2025

Uh oh!

ashwinb left a comment

Choose a reason for hiding this comment

Uh oh!

luis5tb commented Sep 11, 2025

Uh oh!

jwm4 commented Sep 23, 2025

Uh oh!

luis5tb commented Sep 23, 2025

Uh oh!

Uh oh!

luis5tb commented Sep 24, 2025

Uh oh!

jaideepr97 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

luis5tb commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jaideepr97 commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cdoern commented Oct 7, 2025

Uh oh!

luis5tb commented Oct 7, 2025

Uh oh!

luis5tb commented Oct 15, 2025

Uh oh!

ashwinb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

luis5tb commented Sep 9, 2025 •

edited

Loading

jaideepr97 left a comment •

edited

Loading

luis5tb commented Oct 7, 2025 •

edited

Loading

jaideepr97 commented Oct 7, 2025 •

edited

Loading