Addaci commited on
Commit
acd1ebb
Β·
verified Β·
1 Parent(s): cda1c6d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -2
README.md CHANGED
@@ -17,7 +17,7 @@ pinned: false
17
 
18
  <div style="border: 2px solid #cce7ff; background-color: #f0f8ff; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
19
 
20
- # πŸ”¬ **1.0 Research Focus**
21
 
22
  ## **1.1 Fine-tuning Small LLMs**
23
 
@@ -62,11 +62,19 @@ Exploring the potential of small LLMs for cleaning Raw HTR outputs from machine-
62
  # πŸ“š **2.0 Datasets**
63
 
64
  ## **2.1 Published Datasets**
 
 
 
65
  1. [MarineLives/English-Expansions](https://huggingface.co/datasets/MarineLives/English-Expansions)
66
  2. [MarineLives/Latin-Expansions](https://huggingface.co/datasets/MarineLives/Latin-Expansions)
67
  3. [MarineLives/Line-Insertions](https://huggingface.co/datasets/MarineLives/Line-Insertions)
68
  4. [MarineLives/HCA-1358-Errors-In-Phrases](https://huggingface.co/datasets/MarineLives/HCA-1358-Errors-In-Phrases)
69
  5. [MarineLives/HCA-13-58-TEXT](https://huggingface.co/datasets/MarineLives/HCA-13-58-TEXT)
 
 
 
 
 
70
 
71
  ## **2.2 Unpublished Datasets**
72
  - **Dataset 1**: 420K tokens, full diplomatic transcription (1627–1660)
@@ -80,6 +88,6 @@ Exploring the potential of small LLMs for cleaning Raw HTR outputs from machine-
80
  # 🌍 **Explore MarineLives**
81
  Join us in unlocking Early Modern history by exploring our [Hugging Face organization](https://huggingface.co/MarineLives) and datasets!
82
  You can follow us on BlueSky at [@marinelives.bsky.social](https://bsky.app/profile/marinelives.bsky.social)
83
- You can explore our content on our [MarineLives wiki](http://www.marinelives.org/wiki/MarineLives) and on our [MarineLives Transkribus site]
84
 
85
  </div>
 
17
 
18
  <div style="border: 2px solid #cce7ff; background-color: #f0f8ff; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
19
 
20
+ # πŸ”¬ **1.0 Research Focus on Hugging Face**
21
 
22
  ## **1.1 Fine-tuning Small LLMs**
23
 
 
62
  # πŸ“š **2.0 Datasets**
63
 
64
  ## **2.1 Published Datasets**
65
+
66
+ ### **ENGLISH HIGH COURT OF ADMIRALTY DEPOSITIONS**
67
+
68
  1. [MarineLives/English-Expansions](https://huggingface.co/datasets/MarineLives/English-Expansions)
69
  2. [MarineLives/Latin-Expansions](https://huggingface.co/datasets/MarineLives/Latin-Expansions)
70
  3. [MarineLives/Line-Insertions](https://huggingface.co/datasets/MarineLives/Line-Insertions)
71
  4. [MarineLives/HCA-1358-Errors-In-Phrases](https://huggingface.co/datasets/MarineLives/HCA-1358-Errors-In-Phrases)
72
  5. [MarineLives/HCA-13-58-TEXT](https://huggingface.co/datasets/MarineLives/HCA-13-58-TEXT)
73
+
74
+ ### **YIDDISH LETTERS**
75
+
76
+ 1. [MarineLives/Gavin-yiddish-raw-HTR-and-groundtruth-lines](https://huggingface.co/datasets/MarineLives/Gavin_yiddish_raw_HT_and_groundtruth_lines)
77
+ 2. [MarineLives/Gavin-yiddish-raw-HTR-and-groundtruth-paragraphs](https://huggingface.co/datasets/MarineLives/Gavin_yiddish_raw_HTR_and_groundtruth_paragraphs)
78
 
79
  ## **2.2 Unpublished Datasets**
80
  - **Dataset 1**: 420K tokens, full diplomatic transcription (1627–1660)
 
88
  # 🌍 **Explore MarineLives**
89
  Join us in unlocking Early Modern history by exploring our [Hugging Face organization](https://huggingface.co/MarineLives) and datasets!
90
  You can follow us on BlueSky at [@marinelives.bsky.social](https://bsky.app/profile/marinelives.bsky.social)
91
+ You can explore our content on our [MarineLives wiki](http://www.marinelives.org/wiki/MarineLives) and on our [ai-and-history-collaboratory GitHub repository](https://github.com/Addaci/marinelives-collaboratory/wiki).
92
 
93
  </div>