Shotaro30678
/

response_generator_DPO

Text Generation

4-bit precision

Model card Files Files and versions Community

Shotaro30678 commited on Aug 26

Commit

77baa3a

•

1 Parent(s): 2910e15

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -33,7 +33,7 @@ Use dpo trainer to do the RLHF so that the model can be more precise and consist
 **[Shotaro30678/emotion_text_classifier_on_dd_v1](https://huggingface.co/Shotaro30678/emotion_text_classifier_on_dd_v1)**
 | **Metric**   | **DPO Trained Model** | **SFT Model (Reference)** |
-|--------------|-----------------------|---------------------------|
 | **Accuracy** | 0.851                 | 0.788                     |
 | **F1-score** | 0.8564                | 0.7975                    |
@@ -41,7 +41,7 @@ Use dpo trainer to do the RLHF so that the model can be more precise and consist
 **[madhurjindal/autonlp-Gibberish-Detector-492513457](https://huggingface.co/madhurjindal/autonlp-Gibberish-Detector-492513457)**
 | **Category**        | **DPO Trained Model** | **SFT Model (Reference)** |
-|---------------------|-----------------------|---------------------------|
 | **Clean**           | 882                   | 898                       |
 | **Mild Gibberish**  | 94                    | 58                        |
 | **Word Salad**      | 21                    | 33                        |
@@ -50,7 +50,7 @@ Use dpo trainer to do the RLHF so that the model can be more precise and consist
 **Cut-Off Output:**
 | **Output Type**     | **DPO Trained Model** | **SFT Model (Reference)** |
-|---------------------|-----------------------|---------------------------|
 | **Complete Output** | 985                   | 975                       |
 | **Incomplete Output** | 15                  | 25                        |

 **[Shotaro30678/emotion_text_classifier_on_dd_v1](https://huggingface.co/Shotaro30678/emotion_text_classifier_on_dd_v1)**
 | **Metric**   | **DPO Trained Model** | **SFT Model (Reference)** |
+|--------------|:----------------------:|:--------------------------:|
 | **Accuracy** | 0.851                 | 0.788                     |
 | **F1-score** | 0.8564                | 0.7975                    |
 **[madhurjindal/autonlp-Gibberish-Detector-492513457](https://huggingface.co/madhurjindal/autonlp-Gibberish-Detector-492513457)**
 | **Category**        | **DPO Trained Model** | **SFT Model (Reference)** |
+|---------------------|:----------------------:|:--------------------------:|
 | **Clean**           | 882                   | 898                       |
 | **Mild Gibberish**  | 94                    | 58                        |
 | **Word Salad**      | 21                    | 33                        |
 **Cut-Off Output:**
 | **Output Type**     | **DPO Trained Model** | **SFT Model (Reference)** |
+|---------------------|:----------------------:|:--------------------------:|
 | **Complete Output** | 985                   | 975                       |
 | **Incomplete Output** | 15                  | 25                        |