fix(langchain): capture usage on streamed gemini responses #1309

hassiebp · 2025-08-22T11:24:20Z

Important

Fixes usage parsing bug in CallbackHandler.py for streamed Gemini responses, ensuring valid usage data is captured.

Behavior:
- Fixes usage parsing bug in _parse_usage in CallbackHandler.py for streamed Gemini responses.
- Loop now breaks only when _parse_usage_model() returns non-None, ensuring valid usage data is found.
Tests:
- Marks test_basic_chat_openai in test_langchain.py as flaky.

^{This description was created by}^{for 15ff192. You can customize this summary. It will automatically update as commits are pushed.}

Disclaimer: Experimental PR review

Greptile Summary

This PR fixes a bug in the Langchain integration's usage parsing logic for streamed Gemini responses. The change modifies the _parse_usage function in CallbackHandler.py to improve how it handles usage metadata extraction from generation chunks.

The core issue was in the loop that iterates through generation chunks looking for usage metadata. Previously, the function would break out of the loop immediately after finding any chunk with usage_metadata, regardless of whether that metadata was valid or usable. The fix adds a conditional check that only breaks the loop when _parse_usage_model() returns a non-None value, indicating that valid usage data was actually found.

This change is specifically important for Gemini streaming responses, where early chunks may contain empty or invalid usage metadata, while later chunks contain the actual token usage information. The Langfuse Usage model (as seen in the context) tracks input/output tokens and associated costs, making accurate usage capture critical for billing and monitoring purposes. By ensuring the function continues searching until it finds valid usage data, this fix enables proper token usage tracking for streamed Gemini responses in the Langchain integration.

Confidence score: 4/5

This PR addresses a specific, well-defined bug with a minimal and focused change
Score reflects the targeted nature of the fix and clear understanding of the problem being solved
Pay close attention to the modified usage parsing logic in CallbackHandler.py

greptile-apps

_{1 file reviewed, no comments}

_{Edit Code Review Bot Settings | Greptile}

rvndbalaji · 2025-08-26T05:08:14Z

Hi @hassiebp , Im eagerly waiting for this to be merged, any ETA on when this might be merged and released?

hassiebp · 2025-08-26T08:56:23Z

@rvndbalaji This is released in https://github.com/langfuse/langfuse-python/releases/tag/v3.3.1 👍🏾

fix(langchain): capture usage on streamed gemini responses

cd45de3

hassiebp enabled auto-merge (squash) August 22, 2025 11:24

hassiebp linked an issue Aug 22, 2025 that may be closed by this pull request

bug: Gemini 2.5 Token Usage NOT shown and shows 0 cost when streaming is enabled. Works fine when streaming is disabled langfuse/langfuse#8625

Closed

greptile-apps bot reviewed Aug 22, 2025

View reviewed changes

hassiebp added 2 commits August 26, 2025 10:02

push

15ff192

Merge branch 'main' into fix-langchain-gemini-streaming-usage

6f8b601

hassiebp disabled auto-merge August 26, 2025 08:47

hassiebp merged commit df5b72a into main Aug 26, 2025
8 of 10 checks passed

hassiebp deleted the fix-langchain-gemini-streaming-usage branch August 26, 2025 08:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(langchain): capture usage on streamed gemini responses #1309

fix(langchain): capture usage on streamed gemini responses #1309

Uh oh!

hassiebp commented Aug 22, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

greptile-apps bot left a comment

Uh oh!

rvndbalaji commented Aug 26, 2025 •

edited

Loading

Uh oh!

Uh oh!

hassiebp commented Aug 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix(langchain): capture usage on streamed gemini responses #1309

fix(langchain): capture usage on streamed gemini responses #1309

Uh oh!

Conversation

hassiebp commented Aug 22, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Disclaimer: Experimental PR review

Greptile Summary

Confidence score: 4/5

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

rvndbalaji commented Aug 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

hassiebp commented Aug 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hassiebp commented Aug 22, 2025 •

edited by ellipsis-dev bot

Loading

rvndbalaji commented Aug 26, 2025 •

edited

Loading