newmodel
Browse files- README.md +14 -22
- model.safetensors +1 -1
- training_args.bin +1 -1
README.md
CHANGED
@@ -20,12 +20,12 @@ should probably proofread and complete it, then remove this comment. -->
|
|
20 |
|
21 |
This model is a fine-tuned version of [projecte-aina/roberta-base-ca-v2-cased-te](https://huggingface.co/projecte-aina/roberta-base-ca-v2-cased-te) on the None dataset.
|
22 |
It achieves the following results on the evaluation set:
|
23 |
-
- Loss: 0.
|
24 |
-
- Accuracy: 0.
|
25 |
-
- Precision: 0.
|
26 |
-
- Recall: 0.
|
27 |
-
- F1: 0.
|
28 |
-
- Ratio: 0.
|
29 |
|
30 |
## Model description
|
31 |
|
@@ -54,28 +54,20 @@ The following hyperparameters were used during training:
|
|
54 |
- lr_scheduler_type: linear
|
55 |
- lr_scheduler_warmup_ratio: 0.06
|
56 |
- lr_scheduler_warmup_steps: 4
|
57 |
-
- num_epochs:
|
58 |
- label_smoothing_factor: 0.1
|
59 |
|
60 |
### Training results
|
61 |
|
62 |
| Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 | Ratio |
|
63 |
|:-------------:|:------:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|:------:|
|
64 |
-
|
|
65 |
-
|
|
66 |
-
| 0.
|
67 |
-
| 0.
|
68 |
-
| 0.
|
69 |
-
| 0.
|
70 |
-
| 0.
|
71 |
-
| 0.67 | 2.0779 | 80 | 0.6208 | 0.8025 | 0.8043 | 0.8025 | 0.8022 | 0.5378 |
|
72 |
-
| 0.5994 | 2.3377 | 90 | 0.6106 | 0.8235 | 0.8236 | 0.8235 | 0.8235 | 0.4916 |
|
73 |
-
| 0.666 | 2.5974 | 100 | 0.5912 | 0.8361 | 0.8363 | 0.8361 | 0.8361 | 0.5126 |
|
74 |
-
| 0.6142 | 2.8571 | 110 | 0.5853 | 0.8319 | 0.8320 | 0.8319 | 0.8319 | 0.5084 |
|
75 |
-
| 0.6181 | 3.1169 | 120 | 0.5866 | 0.8361 | 0.8373 | 0.8361 | 0.8360 | 0.5294 |
|
76 |
-
| 0.5555 | 3.3766 | 130 | 0.5762 | 0.8487 | 0.8496 | 0.8487 | 0.8486 | 0.4748 |
|
77 |
-
| 0.5658 | 3.6364 | 140 | 0.5751 | 0.8487 | 0.8496 | 0.8487 | 0.8486 | 0.4748 |
|
78 |
-
| 0.5777 | 3.8961 | 150 | 0.5736 | 0.8487 | 0.8491 | 0.8487 | 0.8487 | 0.4832 |
|
79 |
|
80 |
|
81 |
### Framework versions
|
|
|
20 |
|
21 |
This model is a fine-tuned version of [projecte-aina/roberta-base-ca-v2-cased-te](https://huggingface.co/projecte-aina/roberta-base-ca-v2-cased-te) on the None dataset.
|
22 |
It achieves the following results on the evaluation set:
|
23 |
+
- Loss: 0.5939
|
24 |
+
- Accuracy: 0.8445
|
25 |
+
- Precision: 0.8451
|
26 |
+
- Recall: 0.8445
|
27 |
+
- F1: 0.8445
|
28 |
+
- Ratio: 0.4790
|
29 |
|
30 |
## Model description
|
31 |
|
|
|
54 |
- lr_scheduler_type: linear
|
55 |
- lr_scheduler_warmup_ratio: 0.06
|
56 |
- lr_scheduler_warmup_steps: 4
|
57 |
+
- num_epochs: 2
|
58 |
- label_smoothing_factor: 0.1
|
59 |
|
60 |
### Training results
|
61 |
|
62 |
| Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 | Ratio |
|
63 |
|:-------------:|:------:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|:------:|
|
64 |
+
| 0.5731 | 0.2597 | 10 | 0.5646 | 0.8613 | 0.8645 | 0.8613 | 0.8610 | 0.4538 |
|
65 |
+
| 0.5515 | 0.5195 | 20 | 0.5569 | 0.8613 | 0.8626 | 0.8613 | 0.8612 | 0.4706 |
|
66 |
+
| 0.5164 | 0.7792 | 30 | 0.6079 | 0.8277 | 0.8363 | 0.8277 | 0.8266 | 0.5798 |
|
67 |
+
| 0.5641 | 1.0390 | 40 | 0.5728 | 0.8571 | 0.8608 | 0.8571 | 0.8568 | 0.4496 |
|
68 |
+
| 0.4665 | 1.2987 | 50 | 0.5992 | 0.8403 | 0.8407 | 0.8403 | 0.8403 | 0.5168 |
|
69 |
+
| 0.4632 | 1.5584 | 60 | 0.5990 | 0.8613 | 0.8634 | 0.8613 | 0.8611 | 0.4622 |
|
70 |
+
| 0.4456 | 1.8182 | 70 | 0.5939 | 0.8445 | 0.8451 | 0.8445 | 0.8445 | 0.4790 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
71 |
|
72 |
|
73 |
### Framework versions
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 498606684
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3545c58781ed9a29d6be3a2147315694b1c31e3cd2b828798035637fdfed3c1d
|
3 |
size 498606684
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4920
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3e28d2ed42f790e45fafd5fd66ee342fe0eedb2e9b490de8dc363d842039ef22
|
3 |
size 4920
|