Shotaro30678
commited on
Commit
•
77baa3a
1
Parent(s):
2910e15
Update README.md
Browse files
README.md
CHANGED
@@ -33,7 +33,7 @@ Use dpo trainer to do the RLHF so that the model can be more precise and consist
|
|
33 |
**[Shotaro30678/emotion_text_classifier_on_dd_v1](https://huggingface.co/Shotaro30678/emotion_text_classifier_on_dd_v1)**
|
34 |
|
35 |
| **Metric** | **DPO Trained Model** | **SFT Model (Reference)** |
|
36 |
-
|
37 |
| **Accuracy** | 0.851 | 0.788 |
|
38 |
| **F1-score** | 0.8564 | 0.7975 |
|
39 |
|
@@ -41,7 +41,7 @@ Use dpo trainer to do the RLHF so that the model can be more precise and consist
|
|
41 |
**[madhurjindal/autonlp-Gibberish-Detector-492513457](https://huggingface.co/madhurjindal/autonlp-Gibberish-Detector-492513457)**
|
42 |
|
43 |
| **Category** | **DPO Trained Model** | **SFT Model (Reference)** |
|
44 |
-
|
45 |
| **Clean** | 882 | 898 |
|
46 |
| **Mild Gibberish** | 94 | 58 |
|
47 |
| **Word Salad** | 21 | 33 |
|
@@ -50,7 +50,7 @@ Use dpo trainer to do the RLHF so that the model can be more precise and consist
|
|
50 |
**Cut-Off Output:**
|
51 |
|
52 |
| **Output Type** | **DPO Trained Model** | **SFT Model (Reference)** |
|
53 |
-
|
54 |
| **Complete Output** | 985 | 975 |
|
55 |
| **Incomplete Output** | 15 | 25 |
|
56 |
|
|
|
33 |
**[Shotaro30678/emotion_text_classifier_on_dd_v1](https://huggingface.co/Shotaro30678/emotion_text_classifier_on_dd_v1)**
|
34 |
|
35 |
| **Metric** | **DPO Trained Model** | **SFT Model (Reference)** |
|
36 |
+
|--------------|:----------------------:|:--------------------------:|
|
37 |
| **Accuracy** | 0.851 | 0.788 |
|
38 |
| **F1-score** | 0.8564 | 0.7975 |
|
39 |
|
|
|
41 |
**[madhurjindal/autonlp-Gibberish-Detector-492513457](https://huggingface.co/madhurjindal/autonlp-Gibberish-Detector-492513457)**
|
42 |
|
43 |
| **Category** | **DPO Trained Model** | **SFT Model (Reference)** |
|
44 |
+
|---------------------|:----------------------:|:--------------------------:|
|
45 |
| **Clean** | 882 | 898 |
|
46 |
| **Mild Gibberish** | 94 | 58 |
|
47 |
| **Word Salad** | 21 | 33 |
|
|
|
50 |
**Cut-Off Output:**
|
51 |
|
52 |
| **Output Type** | **DPO Trained Model** | **SFT Model (Reference)** |
|
53 |
+
|---------------------|:----------------------:|:--------------------------:|
|
54 |
| **Complete Output** | 985 | 975 |
|
55 |
| **Incomplete Output** | 15 | 25 |
|
56 |
|