Image-to-Text
Transformers
PyTorch
TensorBoard
Russian
English
vision-encoder-decoder
image-text-to-text
ocr
Inference Endpoints
trocr-base-ru / README.md
sherstpasha's picture
Update README.md
02498f0 verified
|
raw
history blame
1.07 kB
---
license: apache-2.0
datasets:
- nastyboget/stackmix_hkr_large
- nastyboget/stackmix_cyrillic_large
- nastyboget/synthetic_cyrillic_large
language:
- ru
- en
pipeline_tag: image-to-text
widget:
- src: examples/1.png
- src: examples/2.png
- src: examples/3.png
- src: examples/4.png
tags:
- ocr
---
# Model Card for TrOCR-Ru
<!-- Provide a quick summary of what the model is/does. -->
Finetuned model [microsoft/trocr-base-handwritten](https://huggingface.co/microsoft/trocr-base-handwritten) on large synth datasets from [nastyboget](https://huggingface.co/nastyboget).
Forked from https://huggingface.co/raxtemur/trocr-base-ru
## Metrics on HKR/Cyrillic datasets
| Metric | HKR_val | HKR_test1 | HKR_test2 | CYR_val | CYR_test |
|:--------:|:---------:|:---------:|:---------:|:---------:|:---------:|
| Accuracy | 69.9947 | 67.4184 | 69.9187 | 72.3613 | 63.9249 |
| CER | 6.7964 | 8.9113 | 6.7278 | 6.6403 | 9.2576 |
| WER | 21.6688 | 27.3849 | 21.6200 | 27.6715 | 33.2406 |
Last update form 29/02/2024