Skip to content

Commit ed537f8

Browse files
committed
Merge branch 'develop' into 'feat/multimodal_page_boundary_detection'
# Conflicts: # .gitignore
2 parents a7e63d1 + 7908d48 commit ed537f8

File tree

42 files changed

+1396
-97
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

42 files changed

+1396
-97
lines changed

CHANGELOG.md

Lines changed: 23 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,14 +5,35 @@ SPDX-License-Identifier: MIT-0
55

66
## [Unreleased]
77

8+
## [0.3.10]
9+
10+
### Added
11+
12+
- **Agent Analysis Feature for Natural Language Document Analytics**
13+
- Added integrated AI-powered analytics agent that enables natural language querying of processed document data
14+
- **Key Capabilities**: Convert natural language questions to SQL queries, generate interactive visualizations and tables, explore database schema automatically
15+
- **Secure Architecture**: All Python code execution happens in isolated AWS Bedrock AgentCore sandboxes, not in Lambda functions
16+
- **Multi-Tool Agent System**: Database discovery tool for schema exploration, Athena query tool for SQL execution, secure code sandbox for data transfer, Python visualization tool for charts and tables
17+
- **Example Use Cases**: Query document processing volumes and trends, analyze confidence scores and extraction accuracy, explore document classifications and content patterns, generate custom charts and data tables
18+
- **Sample W2 Test Data**: Includes 20 synthetic W2 tax documents for testing analytics capabilities
19+
- **Configurable Models**: Supports multiple AI models including Claude 3.7 Sonnet (default), Claude 3.5 Sonnet, Nova Pro/Lite, and Haiku
20+
- **Web UI Integration**: Accessible through "Document Analytics" section with real-time progress display and query history
21+
22+
- **Automatic Glue Table Creation for Document Sections**
23+
- Added automatic creation of AWS Glue tables for each document section type (classification) during processing
24+
- Tables are created dynamically when new section types are encountered, eliminating manual table creation
25+
- Consistent lowercase naming convention for tables ensures compatibility with case-sensitive S3 paths
26+
- Tables are configured with partition projection for efficient date-based queries without manual partition management
27+
- Automatic schema evolution - tables update when new fields are detected in extraction results
28+
29+
## [0.3.9]
30+
831
### Added
932
- **Optional Permissions Boundary Support for Enterprise Deployments**
1033
- Added `PermissionsBoundaryArn` parameter to all CloudFormation templates for organizations with Service Control Policies (SCPs) requiring permissions boundaries
1134
- Comprehensive support for both explicit IAM roles and implicit roles created by AWS SAM functions and statemachines`
1235
- Conditional implementation ensures backward compatibility - when no permissions boundary is provided, roles deploy normally
1336

14-
## [0.3.8]
15-
1637
### Added
1738
- IDP Configuration and Prompting Best Practices documentation [doc](./docs/idp-configuration-best-practices.md)
1839

VERSION

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
0.3.10-wip
1+
0.3.10

config_library/pattern-1/lending-package-sample/config.yaml

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -194,3 +194,13 @@ pricing:
194194
price: '1.5E-6'
195195
- name: cacheWriteInputTokens
196196
price: '1.875E-5'
197+
- name: bedrock/us.anthropic.claude-opus-4-1-20250805-v1:0
198+
units:
199+
- name: inputTokens
200+
price: '1.5E-5'
201+
- name: outputTokens
202+
price: '7.5E-5'
203+
- name: cacheReadInputTokens
204+
price: '1.5E-6'
205+
- name: cacheWriteInputTokens
206+
price: '1.875E-5'

config_library/pattern-2/lending-package-sample/config.yaml

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1425,3 +1425,13 @@ pricing:
14251425
price: '1.5E-6'
14261426
- name: cacheWriteInputTokens
14271427
price: '1.875E-5'
1428+
- name: bedrock/us.anthropic.claude-opus-4-1-20250805-v1:0
1429+
units:
1430+
- name: inputTokens
1431+
price: '1.5E-5'
1432+
- name: outputTokens
1433+
price: '7.5E-5'
1434+
- name: cacheReadInputTokens
1435+
price: '1.5E-6'
1436+
- name: cacheWriteInputTokens
1437+
price: '1.875E-5'

config_library/pattern-2/rvl-cdip-package-sample/config.yaml

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -891,3 +891,13 @@ pricing:
891891
price: '1.5E-6'
892892
- name: cacheWriteInputTokens
893893
price: '1.875E-5'
894+
- name: bedrock/us.anthropic.claude-opus-4-1-20250805-v1:0
895+
units:
896+
- name: inputTokens
897+
price: '1.5E-5'
898+
- name: outputTokens
899+
price: '7.5E-5'
900+
- name: cacheReadInputTokens
901+
price: '1.5E-6'
902+
- name: cacheWriteInputTokens
903+
price: '1.875E-5'

config_library/pattern-3/rvl-cdip-package-sample/config.yaml

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -751,3 +751,13 @@ pricing:
751751
price: '1.5E-6'
752752
- name: cacheWriteInputTokens
753753
price: '1.875E-5'
754+
- name: bedrock/us.anthropic.claude-opus-4-1-20250805-v1:0
755+
units:
756+
- name: inputTokens
757+
price: '1.5E-5'
758+
- name: outputTokens
759+
price: '7.5E-5'
760+
- name: cacheReadInputTokens
761+
price: '1.5E-6'
762+
- name: cacheWriteInputTokens
763+
price: '1.875E-5'

docs/agent-analysis.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -15,6 +15,10 @@ The Agent Analysis feature provides intelligent data exploration capabilities th
1515
- **Real-time Analysis**: Get insights from your processed documents without manual data analysis
1616
- **Secure Code Execution**: Python visualization code runs in isolated AWS Bedrock AgentCore sandboxes
1717

18+
19+
https://github.com/user-attachments/assets/e2dea2c5-5eb1-42f6-9af5-469afd2135a7
20+
21+
1822
## Key Features
1923

2024
- **Multi-Modal AI Agent**: Uses advanced language models (Claude 3.7 Sonnet by default) for intelligent query understanding

docs/architecture.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -41,6 +41,8 @@ The solution uses a modular architecture with nested CloudFormation stacks to su
4141
- Pattern-specific optimizations and configurations
4242
- Optional features that can be enabled across all patterns:
4343
- Document summarization (controlled by `IsSummarizationEnabled` parameter)
44+
- This feature also enables the "Chat with Document" functionality
45+
- This feature does not use the Bedrock Knowledge Base but stores a full-text text file in S3
4446
- Document Knowledge Base (using Amazon Bedrock)
4547
- Automated accuracy evaluation against baseline data
4648

docs/idp-configuration-best-practices.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1475,6 +1475,7 @@ Configure from these supported models:
14751475
- `us.anthropic.claude-3-7-sonnet-20250219-v1:0`
14761476
- `us.anthropic.claude-sonnet-4-20250514-v1:0`
14771477
- `us.anthropic.claude-opus-4-20250514-v1:0`
1478+
- `us.anthropic.claude-opus-4-1-20250805-v1:0`
14781479

14791480
#### When to Configure Bedrock OCR
14801481

docs/pattern-2.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -257,6 +257,7 @@ To use Bedrock OCR:
257257
- `us.anthropic.claude-3-7-sonnet-20250219-v1:0`
258258
- `us.anthropic.claude-sonnet-4-20250514-v1:0`
259259
- `us.anthropic.claude-opus-4-20250514-v1:0`
260+
- `us.anthropic.claude-opus-4-1-20250805-v1:0`
260261
3. **Configure prompts**: Customize system and task prompts for your specific use case
261262
4. **Deploy**: The configuration can be updated through the Web UI without stack redeployment
262263

0 commit comments

Comments
 (0)