|
--- |
|
license: mit |
|
metrics: |
|
- cer |
|
library_name: transformers |
|
tags: |
|
- medieval |
|
- ocr |
|
- htr |
|
language: |
|
- de |
|
- fr |
|
- la |
|
- nl |
|
--- |
|
# TrOCR Medieval Model with linemasks generated in eScriptorium (https://de.wikipedia.org/wiki/EScriptorium) |
|
Base model: **microsoft/trocr-base-handwritten** |
|
|
|
Epochs: 19.05 / 20 |
|
Eval CER: 0.0329 |
|
|
|
This is a combined model of ground truth of different **charter** and **book scripts** from a variety of projects and institutions, aiming at building a generic model for Latin scripts of the Middle Ages. |
|
It is mainly based on documents from the project CREMMA Manuscrits médiévaux latins, HIMANIS (CNRS), Itinera Nova (Stadsarchief Leuven), and Charters and Records of Königsfelden (Universität Zürich). |
|
|
|
Based on the following data: |
|
CREMMA Manuscrits médiévaux latins has been produced by Clérice, Thibault and Chagué, Alix and Vlachou Efstathiou, Malamatenia. It is licensed under a CC-BY 4.0 license. |
|
URL: https://github.com/HTR-United/CREMMA-Medieval-LAT |
|
|
|
HIMANIS is partially published as HIMANIS Guérin produced by Stutzmann, Dominique; Hamel, Sébastien; Kernier, Iseut de; Mühlberger, Günter; Hackl, Günter. Licensed under a CC-BY 4.0 license. |
|
DOI: 10.5281/zenodo.5535306 |
|
|
|
Charters and Records of Königsfelden Abbey and Bailiwick (1308-1662) has been produced by Halter-Pernet, Colette; Teuscher, Simon; Hodel, Tobias; Barwitzki, Lukas; Egloff, Salome; Henggeler, Fabian; Nadig, Michael; Steinmann, Anina; Stettler, Sabine; Prada Ziegler, Ismail. Licensed under a CC-BY 4.0 license. |
|
DOI: 10.5281/zenodo.5179361 |
|
|
|
The model is based on the same data as the following PyLaia model (available on Transkribus): |
|
https://readcoop.eu/model/charter-scripts-german-latin-french/ |
|
|
|
The model has not been extensively tested. |
|
Potential biases are still to be identified. |