Skip to content

Commit 093f65a

Browse files
committed
Added limitations of Text-Based Holistic Classification in the classification.md file
1 parent f6e42aa commit 093f65a

File tree

1 file changed

+9
-0
lines changed

1 file changed

+9
-0
lines changed

docs/classification.md

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -67,6 +67,15 @@ classification:
6767
</document-text>
6868
```
6969
70+
## Limitations of Text-Based Holistic Classification
71+
72+
Despite its strengths in handling full-document context, this method has several limitations:
73+
74+
- **Context Limitations**: Passing the full document text to the model can exceed the context window, especially for long documents. This restricts use to models that support large context sizes.
75+
- **Hallucination Risk**: When processing lengthy inputs, the model may generate inaccurate or inconsistent classifications due to diluted focus across pages.
76+
- **Model Dependency**: Requires high-context models such as Amazon Nova Premier supports up to 1 million tokens. Smaller models are not suitable for processing long document packages effectively.
77+
- **Scalability Challenges**: Not ideal for very large or visually complex document sets. In such cases, the Multi-Modal Page-Level Classification method is more appropriate.
78+
7079
#### MultiModal Page-Level Classification with Few-Shot Examples
7180
7281
- Classifies each page independently using both text and image data

0 commit comments

Comments
 (0)