byczong's picture
Update README.md
0fccabc verified
metadata
library_name: transformers
tags:
  - legal
license: apache-2.0
datasets:
  - byczong/pl-insurance-terms-struct
language:
  - pl
base_model:
  - naver-clova-ix/donut-base
pipeline_tag: image-text-to-text

Model Card

Donut fine-tuned for full document structuring (parsing) on pl-insurance-terms-struct dataset.

Trained for 10 epochs with max_seq_len=7168.

  • Field-level f1 score: 0.57
  • TED-based accuracy: 0.67

Note: This model and its tokenizer were not (pre-) trained for Polish.