You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CHANGELOG.md
+30Lines changed: 30 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -5,6 +5,36 @@ SPDX-License-Identifier: MIT-0
5
5
6
6
## [Unreleased]
7
7
8
+
## [0.4.5]
9
+
10
+
### Added
11
+
12
+
-**Document Split Classification Metrics for Evaluating Page-Level Classification and Document Segmentation**
13
+
- Added `DocSplitClassificationMetrics` class for comprehensive evaluation of document splitting and classification accuracy
14
+
-**Three Accuracy Types**: Page-level classification accuracy, split accuracy without order consideration, and split accuracy with exact page order matching
-**Automatic Integration**: Integrates with evaluation service when ground truth and predicted sections are available
17
+
-**Documentation**: Guide in `lib/idp_common_pkg/idp_common/evaluation/README.md` with usage examples, metric explanations, and best practices
18
+
19
+
### Fixed
20
+
21
+
-**Evaluation Output URI Fields Lost Across All Patterns - causing (a) missing Page Text Confidence content in UI, (2) failed Assessment step when reprocessing document after editing classes (No module named 'fitz')**
22
+
- Fixed bug where `text_confidence_uri` was being set to null in evaluation output for all three patterns
23
+
- Root cause: AppSync service `_appsync_to_document()` method incorrectly mapped page URIs, and evaluation functions overwrote correct documents with corrupted AppSync responses
24
+
25
+
-**UI: Metering Data Not Displayed During Document Processing**
Copy file name to clipboardExpand all lines: README.md
+4-2Lines changed: 4 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -34,6 +34,8 @@ White-glove customization, deployment, and integration support for production us
34
34
35
35
**Prefer AWS CDK?** This solution is also available as [GenAI IDP Accelerator for AWS CDK](https://github.com/cdklabs/genai-idp), providing the same functional capabilities through AWS CDK constructs for customers who prefer Infrastructure-as-Code with CDK.
36
36
37
+
**Prefer Terraform?** This solution is also available as [GenAI IDP Terraform](https://github.com/awslabs/genai-idp-terraform), providing the same functional capabilities as a Terraform module that integrates with existing infrastructure and supports customization through module variables.
38
+
37
39
## Key Features
38
40
39
41
-**Serverless Architecture**: Built entirely on AWS serverless technologies including Lambda, Step Functions, SQS, and DynamoDB
Copy file name to clipboardExpand all lines: docs/evaluation.md
+193Lines changed: 193 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -649,6 +649,132 @@ This multi-level analysis helps identify specific areas for improvement, such as
649
649
- Performance degradation with larger transaction lists
650
650
- Specific list item attributes that frequently fail evaluation
651
651
652
+
## Document Split Classification Metrics
653
+
654
+
In addition to extraction accuracy evaluation, the framework now includes document split classification metrics to assess how accurately documents are classified and split into sections. This provides a comprehensive evaluation of both **what** was extracted and **how** documents were classified and organized.
655
+
656
+
### Overview
657
+
658
+
Document split classification metrics evaluate three key aspects:
659
+
660
+
1. **Page-Level Classification**: Accuracy of classifying individual pages
661
+
2. **Document Split Grouping**: Accuracy of grouping pages into sections
662
+
3. **Page Order Preservation**: Accuracy of maintaining correct page order within sections
663
+
664
+
These metrics are calculated by comparing the `document_class` and `split_document.page_indices` fields in each section's result JSON.
665
+
666
+
### Three Types of Accuracy
667
+
668
+
#### 1. Page Level Accuracy
669
+
**Purpose**: Measures how accurately individual pages are classified, regardless of how they're grouped into sections.
670
+
671
+
**Calculation**: For each page index across all sections, compare the expected `document_class` with the predicted `document_class`.
672
+
673
+
**Use Case**: Identify if pages are being assigned to the correct document types.
0 commit comments