-
Notifications
You must be signed in to change notification settings - Fork 2.8k
Description
Problem (one or two sentences)
Users need the ability to toggle prompt caching when using AWS Bedrock.
The toggle should default to ON/checked and should be visible on the UI.
If feasible - Inference Profiles should also have an option in the settings.
https://aws.amazon.com/bedrock/cost-optimization/#:~:text=Use%20prompt%20caching%20to%20reduce,repeated%20prompt%20prefixes%20between%20requests.
https://aws.amazon.com/blogs/machine-learning/track-allocate-and-manage-your-generative-ai-cost-and-usage-with-amazon-bedrock/
Note:
This feature may already be partially implemented (#8670 (comment)) but users are unable to see the benefits
Context (who is affected and when)
Any user using AWS Bedrock would benefit from significant cost and latency reductions.
And have the confidence to know that prompt caching is on while having an escape hatch in case they need to turn it off
Desired behavior (conceptual, not technical)
In the providers settings when AWS Bedrock is chosen the option for prompt caching should be with the other checkboxes.
Of course, you should verify that prompt caching is working behind the scenes
Constraints / preferences (optional)
Ensure it works with most common AWS Bedrock setups, custom ARN, and govcloud.
Request checklist
- I've searched existing Issues and Discussions for duplicates
- This describes a specific problem with clear context and impact
Roo Code Task Links (optional)
No response
Acceptance criteria (optional)
No response
Proposed approach (optional)
No response
Trade-offs / risks (optional)
No response
Metadata
Metadata
Assignees
Labels
Type
Projects
Status