qgallouedec HF staff commited on
Commit
0849991
1 Parent(s): 40676ce

End of training

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -10,6 +10,6 @@ tags:
10
  licence: license
11
  ---
12
 
13
- # Model Card for Model name
14
 
15
  This model is a fine-tuned version of [Qwen/Qwen2-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2-0.5B-Instruct) on the https://huggingface.co/datasets/trl-lib/ultrafeedback-prompt dataset.
 
10
  licence: license
11
  ---
12
 
13
+ # Model Card for online-dpo-qwen2-2
14
 
15
  This model is a fine-tuned version of [Qwen/Qwen2-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2-0.5B-Instruct) on the https://huggingface.co/datasets/trl-lib/ultrafeedback-prompt dataset.