byczong's picture
Update README.md
0fccabc verified
---
library_name: transformers
tags:
- legal
license: apache-2.0
datasets:
- byczong/pl-insurance-terms-struct
language:
- pl
base_model:
- naver-clova-ix/donut-base
pipeline_tag: image-text-to-text
---
# Model Card
Donut fine-tuned for full document structuring (parsing) on [pl-insurance-terms-struct](https://huggingface.co/datasets/byczong/pl-insurance-terms-struct) dataset.
Trained for 10 epochs with `max_seq_len=7168`.
- Field-level f1 score: 0.57
- TED-based accuracy: 0.67
Note: This model and its tokenizer were not (pre-) trained for Polish.