metadata
library_name: transformers
tags:
- legal
license: apache-2.0
datasets:
- byczong/pl-insurance-terms-struct
language:
- pl
base_model:
- naver-clova-ix/donut-base
pipeline_tag: image-text-to-text
Model Card
Donut fine-tuned for full document structuring (parsing) on pl-insurance-terms-struct dataset.
Trained for 10 epochs with max_seq_len=7168
.
- Field-level f1 score: 0.57
- TED-based accuracy: 0.67
Note: This model and its tokenizer were not (pre-) trained for Polish.