Implement queue locking mechanism in StreamingResponse to prevent race conditions during activity processing by cleemullins · Pull Request #189 · microsoft/Agents-for-python

cleemullins · 2025-10-20T22:57:17Z

Fixes #188

Fixed Concurrency Errors in `streaming_response.py`

I've identified and fixed several concurrency issues in the streaming response implementation:

Issues Found and Fixed

1. Added `asyncio.Lock` for queue synchronization

Added self._queue_lock in __init__ to protect shared state access
This lock prevents race conditions when multiple async contexts access the queue simultaneously

2. Fixed race condition in `_chunk_queued` flag

Problem: The flag was reset at the beginning of create_activity(), which could cause race conditions
Solution: Moved self._chunk_queued = False to execute after activity creation in all code paths
This ensures the flag is properly reset before returning, preventing lost updates

3. Protected `_queue_sync` task management

Problem: Checking and starting the drain task without synchronization could lead to multiple drain tasks running
Solution:
- Changed _queue_activity() to use a lock when checking and starting the drain task
- Added a check for done() status to handle completed tasks correctly
- This prevents multiple drain tasks from running simultaneously

4. Thread-safe queue operations

Problem: self._queue.pop(0) could be accessed by multiple async contexts
Solution:
- Protected queue access with the lock in _drain_queue()
- Added a double-check pattern to safely handle empty queue scenarios
- Ensures the _queue_sync cleanup happens under lock protection

Key Changes Made

Location	Change
Line 54	Added `self._queue_lock = asyncio.Lock()`
Lines 273-318	Refactored `_queue_next_chunk()` to reset `_chunk_queued` after activity creation
Lines 320-332	Refactored `_queue_activity()` to use lock-protected task startup
Lines 334-362	Protected queue access in `_drain_queue()` with lock

Impact

These changes eliminate race conditions that could occur when:

Multiple async contexts try to queue activities simultaneously
The drain task is checked and started from different contexts
Queue items are being added while being consumed
The _chunk_queued flag is being read/written from different contexts

The implementation is now thread-safe for async operations and prevents potential data corruption or lost updates.

…e conditions during activity processing

Copilot

Pull Request Overview

This PR implements a queue locking mechanism to address race conditions in the StreamingResponse class's activity processing. The changes introduce an asyncio.Lock to protect shared state during concurrent access and refactor queue management logic to ensure thread-safe operations.

Key Changes:

Added asyncio.Lock to synchronize access to the activity queue and related state
Refactored _chunk_queued flag reset logic to occur after activity creation rather than at the beginning
Modified queue drain task management to use lock-protected startup and completion checks

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-10-20T22:58:37Z