Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -13,7 +13,9 @@ and economic history.
|
|
13 |
|
14 |
RESEARCH FOCUS
|
15 |
|
16 |
-
Broad objective
|
|
|
|
|
17 |
the machine transcription of English High Court of Admiralty depositions. We have both Raw HTR output and human corrected
|
18 |
HTR for the same tokens with page to page congruence, and broadly line by line congruence.
|
19 |
|
@@ -69,6 +71,8 @@ HTR for the same tokens with page to page congruence, and broadly line by line c
|
|
69 |
8. User testing of readability of raw HTR and different levels of machine and hand correction
|
70 |
* Impact on readability of raw HRT + rules based Python script optimised to domain
|
71 |
* Impact on readability of raw HTR + rules based Python script optimised to domain + different categories of fin-tuned small LLM machine adjustment
|
|
|
|
|
72 |
|
73 |
DATASETS
|
74 |
|
|
|
13 |
|
14 |
RESEARCH FOCUS
|
15 |
|
16 |
+
**Broad objective**
|
17 |
+
|
18 |
+
Explore the potential for small LLMs to support the process of cleaning Raw HTR output after
|
19 |
the machine transcription of English High Court of Admiralty depositions. We have both Raw HTR output and human corrected
|
20 |
HTR for the same tokens with page to page congruence, and broadly line by line congruence.
|
21 |
|
|
|
71 |
8. User testing of readability of raw HTR and different levels of machine and hand correction
|
72 |
* Impact on readability of raw HRT + rules based Python script optimised to domain
|
73 |
* Impact on readability of raw HTR + rules based Python script optimised to domain + different categories of fin-tuned small LLM machine adjustment
|
74 |
+
|
75 |
+
**Integration of small LLMs with RAG pipeline**
|
76 |
|
77 |
DATASETS
|
78 |
|