Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -46,6 +46,8 @@ HTR for the same tokens with page to page congruence, and broadly line by line c
|
|
46 |
* Ability to accurately identify and delete HTR artefacts (produced by non-textual data on original scanned image)
|
47 |
* Ability to insert text at an insertion mark recorded in the HTR output text, selecting the text to inset
|
48 |
from the line above or below the line containing the insertion mark
|
|
|
|
|
49 |
|
50 |
4. Explore the ability to use a fine-tuned domain specific small LLM to control post-HTR cleanup process steps
|
51 |
* Process Step One: Run rule-based Python script to expand abbreviations and contractions
|
|
|
46 |
* Ability to accurately identify and delete HTR artefacts (produced by non-textual data on original scanned image)
|
47 |
* Ability to insert text at an insertion mark recorded in the HTR output text, selecting the text to inset
|
48 |
from the line above or below the line containing the insertion mark
|
49 |
+
* Ability to identify structural components of a legal deposition (front matter; section headings; numbered articles
|
50 |
+
in allegations; numbered positions in libels; signatures)
|
51 |
|
52 |
4. Explore the ability to use a fine-tuned domain specific small LLM to control post-HTR cleanup process steps
|
53 |
* Process Step One: Run rule-based Python script to expand abbreviations and contractions
|