Addaci commited on
Commit
637ef4c
1 Parent(s): 2a7eb03

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -46,6 +46,8 @@ HTR for the same tokens with page to page congruence, and broadly line by line c
46
  * Ability to accurately identify and delete HTR artefacts (produced by non-textual data on original scanned image)
47
  * Ability to insert text at an insertion mark recorded in the HTR output text, selecting the text to inset
48
  from the line above or below the line containing the insertion mark
 
 
49
 
50
  4. Explore the ability to use a fine-tuned domain specific small LLM to control post-HTR cleanup process steps
51
  * Process Step One: Run rule-based Python script to expand abbreviations and contractions
 
46
  * Ability to accurately identify and delete HTR artefacts (produced by non-textual data on original scanned image)
47
  * Ability to insert text at an insertion mark recorded in the HTR output text, selecting the text to inset
48
  from the line above or below the line containing the insertion mark
49
+ * Ability to identify structural components of a legal deposition (front matter; section headings; numbered articles
50
+ in allegations; numbered positions in libels; signatures)
51
 
52
  4. Explore the ability to use a fine-tuned domain specific small LLM to control post-HTR cleanup process steps
53
  * Process Step One: Run rule-based Python script to expand abbreviations and contractions