Skip to content

Commit 9a79fd3

Browse files
author
Bob Strahan
committed
optimize: update assessment configuration for cost efficiency and performance
1 parent 1532e27 commit 9a79fd3

File tree

12 files changed

+19
-19
lines changed

12 files changed

+19
-19
lines changed

CHANGELOG.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -49,7 +49,7 @@ SPDX-License-Identifier: MIT-0
4949
- Maintains backward compatibility for stack updates by keeping the parameter value "default" mapped to the rvl-cdip-sample for pattern-2.
5050

5151
- **Reduce assessment step costs**
52-
- Default model for granular assessment is now `us.amazon.nova-pro-v1:0`
52+
- Default model for granular assessment is now `us.amazon.nova-lite-v1:0` - experimentation recommended
5353
- Improved placement of <<CACHEPOINT>> tags in assessment prompt to improve utilization of prompt caching
5454

5555
### Fixed

config_library/pattern-2/bank-statement-sample/config.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -376,7 +376,7 @@ assessment:
376376
max_workers: "20"
377377
simple_batch_size: "3"
378378
list_batch_size: "1"
379-
default_confidence_threshold: '0.9'
379+
default_confidence_threshold: '0.8'
380380
top_p: '0.1'
381381
max_tokens: '10000'
382382
top_k: '5'

config_library/pattern-2/lending-package-sample/config.yaml

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -915,8 +915,8 @@ classes:
915915
attributeType: group
916916
classification:
917917
image:
918-
target_height: '1268'
919-
target_width: '951'
918+
target_height: ''
919+
target_width: ''
920920
top_p: '0.1'
921921
max_tokens: '4096'
922922
top_k: '5'
@@ -1144,19 +1144,19 @@ summarization:
11441144
You are a document summarization expert who can analyze and summarize documents from various domains including medical, financial, legal, and general business documents. Your task is to create a summary that captures the key information, main points, and important details from the document. Your output must be in valid JSON format. \nSummarization Style: Balanced\\nCreate a balanced summary that provides a moderate level of detail. Include the main points and key supporting information, while maintaining the document's overall structure. Aim for a comprehensive yet concise summary.\n Your output MUST be in valid JSON format with markdown content. You MUST strictly adhere to the output format specified in the instructions.
11451145
assessment:
11461146
image:
1147-
target_height: '1268'
1148-
target_width: '951'
1147+
target_height: ''
1148+
target_width: ''
11491149
granular:
11501150
enabled: true
11511151
max_workers: "20"
11521152
simple_batch_size: "3"
11531153
list_batch_size: "1"
1154-
default_confidence_threshold: '0.9'
1154+
default_confidence_threshold: '0.8'
11551155
top_p: '0.1'
11561156
max_tokens: '10000'
11571157
top_k: '5'
11581158
temperature: '0.0'
1159-
model: us.amazon.nova-pro-v1:0
1159+
model: us.amazon.nova-lite-v1:0
11601160
system_prompt: >-
11611161
You are a document analysis assessment expert. Your task is to evaluate the confidence of extraction results by analyzing the source document evidence. Respond only with JSON containing confidence scores for each extracted attribute.
11621162
task_prompt: >-

config_library/pattern-2/rvl-cdip-package-sample-with-few-shot-examples/config.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -894,12 +894,12 @@ assessment:
894894
max_workers: "20"
895895
simple_batch_size: "3"
896896
list_batch_size: "1"
897-
default_confidence_threshold: '0.9'
897+
default_confidence_threshold: '0.8'
898898
top_p: '0.1'
899899
max_tokens: '10000'
900900
top_k: '5'
901901
temperature: '0.0'
902-
model: us.amazon.nova-pro-v1:0
902+
model: us.amazon.nova-lite-v1:0
903903
system_prompt: >-
904904
You are a document analysis assessment expert. Your task is to evaluate the confidence of extraction results by analyzing the source document evidence. Respond only with JSON containing confidence scores for each extracted attribute.
905905
task_prompt: >-

config_library/pattern-2/rvl-cdip-package-sample/config.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -616,12 +616,12 @@ assessment:
616616
max_workers: "20"
617617
simple_batch_size: "3"
618618
list_batch_size: "1"
619-
default_confidence_threshold: '0.9'
619+
default_confidence_threshold: '0.8'
620620
top_p: '0.1'
621621
max_tokens: '10000'
622622
top_k: '5'
623623
temperature: '0.0'
624-
model: us.amazon.nova-pro-v1:0
624+
model: us.amazon.nova-lite-v1:0
625625
system_prompt: >-
626626
You are a document analysis assessment expert. Your task is to evaluate the confidence of extraction results by analyzing the source document evidence. Respond only with JSON containing confidence scores for each extracted attribute.
627627
task_prompt: >-

config_library/pattern-3/rvl-cdip-package-sample/config.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -476,7 +476,7 @@ assessment:
476476
max_workers: "20"
477477
simple_batch_size: "3"
478478
list_batch_size: "1"
479-
default_confidence_threshold: '0.9'
479+
default_confidence_threshold: '0.8'
480480
top_p: '0.1'
481481
max_tokens: '10000'
482482
top_k: '5'

docs/assessment.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -542,7 +542,7 @@ The assessment service supports configurable image dimensions for optimal confid
542542
543543
```yaml
544544
assessment:
545-
model: "us.amazon.nova-pro-v1:0"
545+
model: "us.amazon.nova-lite-v1:0"
546546
# Image processing settings - preserves original resolution
547547
image:
548548
target_width: "" # Empty string = no resizing (recommended)

lib/idp_common_pkg/idp_common/assessment/README_GRANULAR.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -72,7 +72,7 @@ The granular service uses the same `task_prompt` template as the original servic
7272
```yaml
7373
assessment:
7474
# Standard assessment configuration
75-
default_confidence_threshold: '0.9'
75+
default_confidence_threshold: '0.8'
7676
model: us.anthropic.claude-3-7-sonnet-20250219-v1:0
7777
system_prompt: "Your assessment system prompt..."
7878
task_prompt: "Your assessment task prompt with <<CACHEPOINT>>..."

notebooks/examples/config/assessment.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
# Assessment Service Configuration
22
# This configuration works with both original and granular assessment approaches
33
assessment:
4-
default_confidence_threshold: '0.9'
4+
default_confidence_threshold: '0.8'
55
top_p: '0.1'
66
max_tokens: '10000'
77
top_k: '5'

notebooks/examples/config/assessment_granular.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@
33
# accuracy and scalability when assessing document extraction confidence.
44

55
assessment:
6-
default_confidence_threshold: '0.9'
6+
default_confidence_threshold: '0.8'
77
top_p: '0.1'
88
max_tokens: '10000'
99
top_k: '5'

0 commit comments

Comments
 (0)