Shotaro30678 commited on
Commit
77baa3a
1 Parent(s): 2910e15

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -33,7 +33,7 @@ Use dpo trainer to do the RLHF so that the model can be more precise and consist
33
  **[Shotaro30678/emotion_text_classifier_on_dd_v1](https://huggingface.co/Shotaro30678/emotion_text_classifier_on_dd_v1)**
34
 
35
  | **Metric** | **DPO Trained Model** | **SFT Model (Reference)** |
36
- |--------------|-----------------------|---------------------------|
37
  | **Accuracy** | 0.851 | 0.788 |
38
  | **F1-score** | 0.8564 | 0.7975 |
39
 
@@ -41,7 +41,7 @@ Use dpo trainer to do the RLHF so that the model can be more precise and consist
41
  **[madhurjindal/autonlp-Gibberish-Detector-492513457](https://huggingface.co/madhurjindal/autonlp-Gibberish-Detector-492513457)**
42
 
43
  | **Category** | **DPO Trained Model** | **SFT Model (Reference)** |
44
- |---------------------|-----------------------|---------------------------|
45
  | **Clean** | 882 | 898 |
46
  | **Mild Gibberish** | 94 | 58 |
47
  | **Word Salad** | 21 | 33 |
@@ -50,7 +50,7 @@ Use dpo trainer to do the RLHF so that the model can be more precise and consist
50
  **Cut-Off Output:**
51
 
52
  | **Output Type** | **DPO Trained Model** | **SFT Model (Reference)** |
53
- |---------------------|-----------------------|---------------------------|
54
  | **Complete Output** | 985 | 975 |
55
  | **Incomplete Output** | 15 | 25 |
56
 
 
33
  **[Shotaro30678/emotion_text_classifier_on_dd_v1](https://huggingface.co/Shotaro30678/emotion_text_classifier_on_dd_v1)**
34
 
35
  | **Metric** | **DPO Trained Model** | **SFT Model (Reference)** |
36
+ |--------------|:----------------------:|:--------------------------:|
37
  | **Accuracy** | 0.851 | 0.788 |
38
  | **F1-score** | 0.8564 | 0.7975 |
39
 
 
41
  **[madhurjindal/autonlp-Gibberish-Detector-492513457](https://huggingface.co/madhurjindal/autonlp-Gibberish-Detector-492513457)**
42
 
43
  | **Category** | **DPO Trained Model** | **SFT Model (Reference)** |
44
+ |---------------------|:----------------------:|:--------------------------:|
45
  | **Clean** | 882 | 898 |
46
  | **Mild Gibberish** | 94 | 58 |
47
  | **Word Salad** | 21 | 33 |
 
50
  **Cut-Off Output:**
51
 
52
  | **Output Type** | **DPO Trained Model** | **SFT Model (Reference)** |
53
+ |---------------------|:----------------------:|:--------------------------:|
54
  | **Complete Output** | 985 | 975 |
55
  | **Incomplete Output** | 15 | 25 |
56