Include text/event-stream header only when stream=True #98

vladimirivic · 2025-01-26T02:50:51Z

Summary:
We want to use the headers to negotiate content.

Sending this header in every request will cause server to return chunks, even without the stream=True param.

llama-stack-client inference chat-completion --message="Hello there"

{"event":{"event_type":"start","delta":"Hello"}}

{"event":{"event_type":"progress","delta":"!"}}

{"event":{"event_type":"progress","delta":" How"}}

{"event":{"event_type":"progress","delta":" are"}}

{"event":{"event_type":"progress","delta":" you"}}

{"event":{"event_type":"progress","delta":" today"}}

Test Plan:

pip install .

llama-stack-client configure --endpoint={endpoint} --api-key={api-key}

llama-stack-client inference chat-completion --message="Hello there"
ChatCompletionResponse(completion_message=CompletionMessage(content='Hello! How can I assist you today?', role='assistant', stop_reason='end_of_turn', tool_calls=[]), logprobs=None)

Summary: We want to use the headers to negotiate content. Sending this header in every request will cause server to return chunks, even without the stream=True param. ``` llama-stack-client inference chat-completion --message="Hello there" {"event":{"event_type":"start","delta":"Hello"}} {"event":{"event_type":"progress","delta":"!"}} {"event":{"event_type":"progress","delta":" How"}} {"event":{"event_type":"progress","delta":" are"}} {"event":{"event_type":"progress","delta":" you"}} {"event":{"event_type":"progress","delta":" today"}} ``` Test Plan: ``` pip install . llama-stack-client configure --endpoint={endpoint} --api-key={api-key} llama-stack-client inference chat-completion --message="Hello there" ChatCompletionResponse(completion_message=CompletionMessage(content='Hello! How can I assist you today?', role='assistant', stop_reason='end_of_turn', tool_calls=[]), logprobs=None) ```

ashwinb · 2025-01-26T03:37:49Z

src/llama_stack_client/resources/inference.py

        timeout: float | httpx.Timeout | None | NotGiven = NOT_GIVEN,
    ) -> InferenceChatCompletionResponse | Stream[InferenceChatCompletionResponse]:
-        extra_headers = {"Accept": "text/event-stream", **(extra_headers or {})}
+        if stream is True:


this should be if stream, but the higher level issue is that this is generated code. We need to make sure we auto-apply this patch always after generation (see stainless_sync.sh) or find another way

yanxi0830 · 2025-01-31T02:08:23Z

Is this still needed after #108 ?

ashwinb · 2025-01-31T03:02:38Z

@yanxi0830 yes this is still needed but for a different reason. this header is sent by the client to the server, not the other way round.

facebook-github-bot added the cla signed label Jan 26, 2025

vladimirivic force-pushed the pr98 branch from d9a16de to 957c882 Compare January 26, 2025 02:54

vladimirivic marked this pull request as ready for review January 26, 2025 02:55

vladimirivic requested review from ashwinb, dineshyv, dltn, hardikjshah, raghotham and yanxi0830 as code owners January 26, 2025 02:55

ashwinb reviewed Jan 26, 2025

View reviewed changes

vladimirivic closed this Feb 1, 2025

vladimirivic deleted the pr98 branch February 1, 2025 02:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Include text/event-stream header only when stream=True #98

Include text/event-stream header only when stream=True #98

Uh oh!

vladimirivic commented Jan 26, 2025 •

edited

Loading

Uh oh!

ashwinb Jan 26, 2025

Uh oh!

yanxi0830 commented Jan 31, 2025

Uh oh!

ashwinb commented Jan 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Include text/event-stream header only when stream=True #98

Include text/event-stream header only when stream=True #98

Uh oh!

Conversation

vladimirivic commented Jan 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ashwinb Jan 26, 2025

Choose a reason for hiding this comment

Uh oh!

yanxi0830 commented Jan 31, 2025

Uh oh!

ashwinb commented Jan 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

vladimirivic commented Jan 26, 2025 •

edited

Loading