Dynamic Few-Shot Prompting with RAG and LLM using Amazon S3 Vectors #149

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

lorchda wants to merge 39 commits into aws-solutions-library-samples:main from lorchda:feat/dynamic-few-shot

Contributor

lorchda commented Dec 4, 2025 •

edited

Loading

Issue #43

Description of changes:

Extended class BedrockClient to generate embeddings for Amazon Titan Multimodal Embeddings G1 and Amazon Nova Multimodal Embeddings
Added an example Lambda to return few-shot examples from a lookup to Amazon S3 Vectors using the existing Custom Prompt Lambda interface
Added a notebook to ingest the fcc_invoices dataset (REALKIE) along with ground truth data into Amazon S3 Vectors

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

lorchda force-pushed the feat/dynamic-few-shot branch from 523ca3f to 5393dd5 Compare

December 12, 2025 16:31

rstrahan requested changes

View reviewed changes

Contributor

rstrahan left a comment

Hi Daniel - Thanks so much.. I added a bunch of comments - please respond to each one, and we can meet/chat if it helps. Tx.

plugins/dynamic-few-shot-lambda/README.md


		### Step 3: Populate the Examples Dataset

		Use the [fewshot_dataset_import.ipynb](notebooks/fewshot_dataset_import.ipynb) notebook to import a dataset into S3 Vectors, or manually upload your example documents and metadata to the S3 bucket and vector index created by the stack.

Contributor

rstrahan Dec 15, 2025

Looks like this is fcc_invoices_dataset_import.ipynb now

Contributor

rstrahan Dec 15, 2025

Should you have more guidance for user to prepare and load their own dataset?

plugins/dynamic-few-shot-lambda/README.md


		## Quick Start

		### Step 1: Deploy the Dynamic-few shot Stack

Contributor

rstrahan Dec 15, 2025

State prerequisites, since user must clone repo and have tools (sam, aws cli) deployed.

plugins/dynamic-few-shot-lambda/README.md


		### Step 4: Configure IDP to Use Dynamic Few-Shot

		Add the Lambda ARN to your IDP extraction configuration:

Contributor

rstrahan Dec 15, 2025

Show how to configure the Lambda ARN in the UI (screenshot)

plugins/dynamic-few-shot-lambda/README.md

+                custom_prompt_lambda_arn: "arn:aws:lambda:region:account:function:GENAIIDP-dynamic-few-shot"
+              ```
+              **Important**: Your extraction task prompt must include the `{FEW_SHOT_EXAMPLES}` placeholder where you want the dynamic examples to be inserted.

Contributor

rstrahan Dec 15, 2025

I think we need to be more prescriptive here. How do we make this easy / foolproof?
Eg: Should all our extraction prompt examples contain the {FEW_SHOT_EXAMPLES} placeholder? Does it do no harm if there are no examples?

Contributor Author

lorchda Dec 16, 2025 •

edited

Loading

In the Lambda, I currently have:

logger.warn("Missing {FEW_SHOT_EXAMPLES} placeholder in prompt template")

What I could do is to replace this by:

raise ValueError("Missing {FEW_SHOT_EXAMPLES} placeholder in prompt template")

This will stop the extraction process and the error will propagate to the UI if {FEW_SHOT_EXAMPLES} is missing.

As for adding {FEW_SHOT_EXAMPLES} to all templates - yes, I think that could be done. It should not do any harm, if there are no examples available, the placeholder will just be empty.

Contributor

rstrahan Dec 16, 2025

No, I don't think we should enforce the placeholder be there.. I'm just suggesting that we should add it to the default extraction task_prompt in our configs.

plugins/dynamic-few-shot-lambda/README.md Outdated Show resolved Hide resolved

plugins/dynamic-few-shot-lambda/template.yml Outdated Show resolved Hide resolved

plugins/dynamic-few-shot-lambda/template.yml Outdated Show resolved Hide resolved

plugins/dynamic-few-shot-lambda/template.yml Outdated

    
                          reason: "Demo function - KMS CMK not required, but can be added by customer for production use cases"

                  # checkov:skip=CKV_AWS_158: "Demo function - KMS CMK not required, but can be added by customer for production use cases"

                  Properties:

                    VectorBucketName: !Ref VectorBucketName

Contributor

rstrahan Dec 15, 2025

See my comment above about bucketname..

we should use customer provided existing bucket (if provided) OR create a new bucket with dynamic name.

Contributor Author

lorchda Dec 16, 2025

When running sam deploy --guided, the customer can choose a name. This is then stored in samconfig.toml as a parameter_overrides key/value pair. The above name is only a suggestion presented to the user.

Contributor

rstrahan Dec 16, 2025

Right, but your template will always create a new bucket, and will fail if the bucket name is already in use.
I'm suggesting that you allow user to provide an existing bucket (with index already in there) so that few shot example vectorstore can be shared by multiple IDP stacks, as a convenience. OR, they can leave buckename blank to have stack create one.
We should not require user to enter a bucketname for a new bucket, imho.. better to auto-gen the bucketname,

Contributor Author

lorchda Dec 17, 2025

Understood, this is implemented now - the user can leave VectorBucketName, VectorIndexName and DatasetBucketName blank and the stack will auto-create them. If provided, the stack will re-use existing resources.

plugins/dynamic-few-shot-lambda/template.yml

    
                    extraction:

                      dynamic_few_shot_lambda_arn: "${DynamicFewShotFunction.Arn}"

                MonitoringLink:

Contributor

rstrahan Dec 15, 2025

Nice touch.. A CW Dashboard could be even better, to track number of inferences, successful matches, throttles/retries, etc.

plugins/dynamic-few-shot-lambda/src/IDP-dynamic-few-shot.py

    
              def _s3vectors_find_similar_items_from_image(page_image):

                  """Search for similar items using image query"""

                  embedding = bedrock_client.generate_embedding(

Contributor

rstrahan Dec 15, 2025

This will incur token usage and costs
Can you minimally capture and log token use?
Even better, can you create metering data structure and return in lambda response.. and enhance extraction function to merge this metering into overall metering.. It can be a new 'context' (eg 'FewShotExamplesPlugin') - that way it will show up in the document Cost Estimates and Benchmark results for accurate cost transparency.

Contributor Author

lorchda Dec 17, 2025

https://boto3.amazonaws.com/v1/documentation/api/latest/reference/services/bedrock-runtime/client/invoke_model.html used for generating embeddings does not seem to return token usage.

An approach could be to run a second call to https://boto3.amazonaws.com/v1/documentation/api/latest/reference/services/bedrock-runtime/client/count_tokens.html to retrieve token usage. I will look into it.

Daniel Lorch added 26 commits

December 16, 2025 15:28


          chore: fix missing substitution for custom_prompt_lambda_arn

879f335


          feat: dynamic-few shot Lambda using S3 Vectors

335f87b


          chore: remove whitespace

75eb394


          feat: add support for Amazon Titan Multimodal Embeddings G1 and Amazo…

fbe11b2

…n Nova Multimodal Embeddings


          chore: move idp_common.image import to generate_embedding function, o…

94d33e7

…therwise bedrock client would always require PIL dependency


          feat: add notebook to ingest FATURA2 dataset into S3 vectors

182ec1b


          chore: update input parameter for document_text + fixes

1e7cac3


          feat: add notebook for dynamic few-shot Lambda testing

99a3605


          chore: placeholder bucket name

ae2a925


          chore: clarify distance

bd52a22


          chore: debug log for S3 vectors result

289386b


          chore: filter S3 vectors result by threshold

6fd1b5e


          chore: add comment on PIL requirement for generate_embedding

8d16da1


          chore: move dynamic-few-shot to plugins folder

0b7a57d


          chore: ignore datasets folder

035b28c


          chore: ruff format

854fa8b


          feat: update dynamic-few-shot Lambda to implement Custom Prompt Lambd…

b5f8873

…a interface


          chore: configurable LOG_LEVEL

4cb63fc


          feat: convert image_uri to image bytes from custom lambda invocation

21c9855


          chore: use working bucket from GenAIIDP for dataset + adapt threshold

f99467c


          chore: remove FATURA2 dataset import

72c85f7


          feat: add fcc_invoices (REALKIE) dataset import

3c50242


          chore: use custom_prompt_lambda_arn parameter

c8b3b24


          chore: add classes configuration for step-by-step example

41b2a57


          chore: remove step-by-step extraction notebook

0c2b105


          chore: fix step 3 extraction instructions

2d630ad

lorchda force-pushed the feat/dynamic-few-shot branch from 582b3cd to 2d630ad Compare

December 16, 2025 15:01


          chore: cfn_nag allow * resource on its permissions policy

daf7029

Daniel Lorch added 12 commits

December 16, 2025 17:33


          chore: validation for LogLevel

f1ec3b9


          chore: make LogRetentionDays as parameter

b88ace7


          chore: use KMS key for log group

d278154


          chore: make bucket creation optional, add KMS key, add dataset bucket


          chore: allow access to IDP output bucket

0835cdc


          chore: fix samconfig.toml

b05827c


          chore: add reasoning for cfn_nag

c2e5a14


          chore: add more reasoning

c477c44


          chore: decode base64 images

1f8eb82


          chore: return base64 encoded images instead of image_uri

7dd9f05


          chore: fix parameter

800ed17


          chore: fix permission policy for s3 vectors

2eb8573

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet