File size: 559 Bytes
1b1c88b
 
91476d0
 
 
 
 
 
 
 
 
 
1b1c88b
 
70cf846
1b1c88b
2ca3df4
1b1c88b
91476d0
1b1c88b
91476d0
 
1b1c88b
0fccabc
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
---
library_name: transformers
tags:
- legal
license: apache-2.0
datasets:
- byczong/pl-insurance-terms-struct
language:
- pl
base_model:
- naver-clova-ix/donut-base
pipeline_tag: image-text-to-text
---

# Model Card

Donut fine-tuned for full document structuring (parsing) on [pl-insurance-terms-struct](https://huggingface.co/datasets/byczong/pl-insurance-terms-struct) dataset.

Trained for 10 epochs with `max_seq_len=7168`.

- Field-level f1 score: 0.57
- TED-based accuracy: 0.67


Note: This model and its tokenizer were not (pre-) trained for Polish.