starride-teklia commited on
Commit
29a117e
1 Parent(s): 8098992
Files changed (1) hide show
  1. README.md +13 -20
README.md CHANGED
@@ -14,35 +14,28 @@ datasets:
14
  - Teklia/Alcar
15
  ---
16
 
17
- # HOME-Alcar and Himanis handwritten text recognition
18
 
19
- This model performs Handwritten Text Recognition in Latin.
20
 
21
  ## Model description
22
 
23
- The model has been trained using the PyLaia library on the [HOME-Alcar](https://zenodo.org/record/5600884) document images.
24
- The model was trained on images resized to a fixed height of 128 pixels, keeping the original aspect ratio.
 
25
 
26
- ## Evaluation results
27
-
28
- The model achieves the following results:
29
 
30
- Himanis:
31
-
32
- | set | CER (%) | WER (%) | support |
33
- | ----- | ---------- | --------- | --------- |
34
- | train | 5.31 | 17.47 | 18503 |
35
- | val | 10.37 | 27.63 | 2367 |
36
- | test | 9.87 | 28.27 | 2241 |
37
 
 
38
 
39
- HOME-Alcar:
40
 
41
- | set | CER (%) | WER (%) | support |
42
- | ----- | ---------- | --------- | --------- |
43
- | train | 4.74 | 17.29 | 59969 |
44
- | val | 7.82 | 23.67 | 7905 |
45
- | test | 8.34 | 24.57 | 6932 |
46
 
47
  ## How to use
48
 
 
14
  - Teklia/Alcar
15
  ---
16
 
17
+ # HOME-Alcar handwritten text recognition
18
 
19
+ This model performs Handwritten Text Recognition in Latin on medieval documents.
20
 
21
  ## Model description
22
 
23
+ The model was trained using the PyLaia library on two medieval datasets:
24
+ * [Himanis](https://demo.arkindex.org/browse/5000e248-a624-4df1-8679-1b34679817ef?top_level=true&folder=true) (French)
25
+ * [HOME Alcar](https://demo.arkindex.org/browse/46b9b1f4-baeb-4342-a501-e2f15472a276?top_level=true&folder=true) (Latin)
26
 
27
+ For training, text-lines were resized with a fixed height of 128 pixels, keeping the original aspect ratio.
 
 
28
 
29
+ An external 6-gram character language model can be used to improve recognition. The language model is trained on the text from the HOME Alcar training set.
 
 
 
 
 
 
30
 
31
+ ## Evaluation results
32
 
33
+ The model achieves the following results:
34
 
35
+ | set | Language model | CER (%) | WER (%) | N lines |
36
+ |:------|:---------------|:----------:|:-------:|----------:|
37
+ | test | no | 8.35 | 26.15 | 6932 |
38
+ | test | yes | 7.85 | 23.20 | 6932 |
 
39
 
40
  ## How to use
41