You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Merge branch 'fix/cache-page-classes-on-throttle-failure' into 'develop'
add caching of successful page classes on thread (page) failure to classifier,...
See merge request genaiic-reusable-assets/engagement-artifacts/genaiic-idp-accelerator!153
-`ClassificationResult`: Overall result of a classification operation
227
228
-`Document`: Core document data model used throughout the IDP pipeline
228
229
230
+
## DynamoDB Caching for Resilient Classification
231
+
232
+
The classification service now supports optional DynamoDB caching to improve efficiency and resilience when processing documents with multiple pages. This feature addresses throttling scenarios where some pages succeed while others fail, avoiding the need to reclassify already successful pages on retry.
233
+
234
+
### How It Works
235
+
236
+
1.**Cache Check**: Before processing, the service checks for cached classification results for the document
237
+
2.**Selective Processing**: Only pages without cached results are classified
238
+
3.**Exception-Safe Caching**: Successful page results are cached even when other pages fail
239
+
4.**Retry Efficiency**: Subsequent retries only process previously failed pages
0 commit comments