Skip to content

Commit b403da2

Browse files
committed
Merge branch 'develop' of ssh.gitlab.aws.dev:genaiic-reusable-assets/engagement-artifacts/genaiic-idp-accelerator into develop
2 parents 1c2e439 + df09fde commit b403da2

File tree

40 files changed

+1731
-2440
lines changed

40 files changed

+1731
-2440
lines changed

CHANGELOG.md

Lines changed: 24 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,8 +5,21 @@ SPDX-License-Identifier: MIT-0
55

66
## [Unreleased]
77

8+
### Changed
9+
10+
- Updated lending_package.pdf sample with more realistic driver's license image
11+
12+
### Added
13+
14+
## [0.3.8]
15+
816
### Added
917

18+
- **Lending Package Configuration Support for Pattern-2**
19+
- Added new `lending-package-sample` configuration to Pattern-2, providing comprehensive support for lending and financial document processing workflows
20+
- New default configuration for Pattern-2 stack deployments, optimized for loan applications, mortgage processing, and financial verification documents
21+
- Previous `rvl-cdip-sample` configuration remains available by selecting `rvl-cdip-package-sample` for the `Pattern2Configuration` parameter when deploying or updating stacks
22+
1023
- **Text Confidence View for Document Pages**
1124
- Added support for displaying OCR text confidence data through new `TextConfidenceUri` field
1225
- New "Text Confidence View" option in the UI pages panel alongside existing Markdown and Text views
@@ -20,28 +33,38 @@ SPDX-License-Identifier: MIT-0
2033
- Configurable through Web UI without requiring code changes or redeployment
2134

2235
### Changed
36+
2337
- **Converted text confidence data format from JSON to markdown table for improved readability and reduced token usage**
2438
- Removed unnecessary "page_count" field
2539
- Changed "text_blocks" array to "text" field containing a markdown table with Text and Confidence columns
2640
- Reduces prompt size for assessment service while improving UI readability
2741
- OCR confidence values now rounded to 1 decimal point (e.g., 99.1, 87.3) for cleaner display
2842
- Markdown table headers now explicitly left-aligned using `|:-----|:-----------|` format for consistent appearance
43+
2944
- **Simplified OCR Service Initialization**
3045
- OCR service now accepts a single `config` dictionary parameter for cleaner, more consistent API
3146
- Aligned with classification service pattern for better consistency across IDP services
3247
- Backward compatibility maintained - old parameter pattern still supported with deprecation warning
3348
- Updated all lambda functions and notebooks to use new simplified pattern
3449
- Removed fixed image target_height and target_width from default configurations, so images are processed in original resolution by default.
3550

51+
- **Updated Default Configuration for Pattern1 and Pattern2**
52+
- Changed default configuration for new stacks from "default" to "lending-package-sample" for both Pattern1 and Pattern2
53+
- Maintains backward compatibility for stack updates by keeping the parameter value "default" mapped to the rvl-cdip-sample for pattern-2.
54+
55+
- **Reduce assessment step costs**
56+
- Default model for granular assessment is now `us.amazon.nova-lite-v1:0` - experimentation recommended
57+
- Improved placement of <<CACHEPOINT>> tags in assessment prompt to improve utilization of prompt caching
3658

3759
### Fixed
60+
3861
- **Fixed Image Resizing Behavior for High-Resolution Documents**
3962
- Fixed issue where empty strings in image configuration were incorrectly resizing images to default 951x1268 pixels instead of preserving original resolution
4063
- Empty strings (`""`) in `target_width` and `target_height` configuration now preserve original document resolution for maximum processing accuracy
4164
- Fixed issue where PNG files were being unnecessarily converted to JPEG format and resized to lower resolution with lost quality
4265
- Fixed issue where PNG and JPG image files were not rendering inline in the Document Details page
4366
- Fixed issue where PDF files were being downloaded instead of displayed inline
44-
67+
- Fixed pricing data for cacheWrite tokens for Amazon Nova models to resolve innacurate cost estimation in UI.
4568

4669

4770
## [0.3.7]

README.md

Lines changed: 8 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -23,6 +23,10 @@ SPDX-License-Identifier: MIT-0
2323

2424
A scalable, serverless solution for automated document processing and information extraction using AWS services. This system combines OCR capabilities with generative AI to convert unstructured documents into structured data at scale.
2525

26+
https://github.com/user-attachments/assets/272b543b-e506-48ce-acc1-361422d22322
27+
28+
White-glove customization, deployment, and integration support for production use cases is also available through [AWS Professional Services](https://aws.amazon.com/professional-services/).
29+
2630
## Key Features
2731

2832
- **Serverless Architecture**: Built entirely on AWS serverless technologies including Lambda, Step Functions, SQS, and DynamoDB
@@ -73,8 +77,8 @@ After deployment, you can quickly process a document and view results:
7377
- **Via S3**: Upload directly to the S3 input bucket (find the bucket URL in CloudFormation stack Outputs)
7478

7579
2. **Use Sample Documents**:
76-
- For Pattern 1 (BDA): Use [samples/lending_package.pdf](./samples/lending_package.pdf)
77-
- For Patterns 2 and 3: Use [samples/rvl_cdip_package.pdf](./samples/rvl_cdip_package.pdf)
80+
- For Patterns 1 (BDA) and Pattern 2: Use [samples/lending_package.pdf](./samples/lending_package.pdf)
81+
- For Pattern 3 (UDOP): Use [samples/rvl_cdip_package.pdf](./samples/rvl_cdip_package.pdf)
7882

7983
3. **Monitor Processing**:
8084
- **Via Web UI**: Track document status on the dashboard
@@ -105,8 +109,8 @@ To update an existing GenAIIDP stack to a new version:
105109
7. For detailed instructions, see the [Deployment Guide](./docs/deployment.md#updating-an-existing-stack)
106110

107111
For testing, use these sample files:
108-
- Pattern-1 BDA default project: `samples/lending_package.pdf`
109-
- Patterns 2 and 3 default configurations: `samples/rvl_cdip_package.pdf`
112+
- For Patterns 1 (BDA) and Pattern 2: Use [samples/lending_package.pdf](./samples/lending_package.pdf)
113+
- For Pattern 3 (UDOP): Use [samples/rvl_cdip_package.pdf](./samples/rvl_cdip_package.pdf)
110114

111115
For detailed deployment and testing instructions, see the [Deployment Guide](./docs/deployment.md).
112116

VERSION

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
0.3.8-wip6
1+
0.3.9-wip

config_library/pattern-1/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -27,4 +27,4 @@ See the main [README.md](../README.md) for more detailed instructions on creatin
2727

2828
## Available Configurations
2929

30-
Currently, only the default configuration is available for Pattern 1. Contributions are welcome!
30+
Currently, only the default lending-package-sample configuration is available for Pattern 1. Contributions are welcome!

config_library/pattern-1/default/README.md renamed to config_library/pattern-1/lending-package-sample/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
Copyright Amazon.com, Inc. or its affiliates. All Rights Reserved.
22
SPDX-License-Identifier: MIT-0
33

4-
# Default Configuration
4+
# Default Configuration (lending-package-sample)
55

66
This directory contains the default configuration for the GenAI IDP Accelerator. This configuration serves as the baseline for all document processing tasks and can be used as a starting point for creating custom configurations.
77

config_library/pattern-1/default/config.yaml renamed to config_library/pattern-1/lending-package-sample/config.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -121,7 +121,7 @@ pricing:
121121
- name: cacheReadInputTokens
122122
price: '1.5E-8'
123123
- name: cacheWriteInputTokens
124-
price: '0'
124+
price: '6.0E-8'
125125
- name: bedrock/us.amazon.nova-pro-v1:0
126126
units:
127127
- name: inputTokens
@@ -131,7 +131,7 @@ pricing:
131131
- name: cacheReadInputTokens
132132
price: '2.0E-7'
133133
- name: cacheWriteInputTokens
134-
price: '0'
134+
price: '8.0E-7'
135135
- name: bedrock/us.amazon.nova-premier-v1:0
136136
units:
137137
- name: inputTokens

config_library/pattern-1/default/samples/lending_package.pdf renamed to config_library/pattern-1/lending-package-sample/samples/lending_package.pdf

File renamed without changes.

config_library/pattern-2/bank-statement-sample/config.yaml

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -376,12 +376,12 @@ assessment:
376376
max_workers: "20"
377377
simple_batch_size: "3"
378378
list_batch_size: "1"
379-
default_confidence_threshold: '0.9'
379+
default_confidence_threshold: '0.8'
380380
top_p: '0.1'
381381
max_tokens: '10000'
382382
top_k: '5'
383383
temperature: '0.0'
384-
model: us.anthropic.claude-3-7-sonnet-20250219-v1:0
384+
model: us.amazon.nova-pro-v1:0
385385
system_prompt: >-
386386
You are a document analysis assessment expert. Your task is to evaluate the confidence of extraction results by analyzing the source document evidence. Respond only with JSON containing confidence scores for each extracted attribute.
387387
task_prompt: >-
@@ -579,7 +579,7 @@ pricing:
579579
- name: cacheReadInputTokens
580580
price: '1.5E-8'
581581
- name: cacheWriteInputTokens
582-
price: '0'
582+
price: '6.0E-8'
583583
- name: bedrock/us.amazon.nova-pro-v1:0
584584
units:
585585
- name: inputTokens
@@ -589,7 +589,7 @@ pricing:
589589
- name: cacheReadInputTokens
590590
price: '2.0E-7'
591591
- name: cacheWriteInputTokens
592-
price: '0'
592+
price: '8.0E-7'
593593
- name: bedrock/us.amazon.nova-premier-v1:0
594594
units:
595595
- name: inputTokens

0 commit comments

Comments
 (0)