Model save

Files changed (4) hide show

README.md CHANGED Viewed

@@ -27,7 +27,7 @@ print(output["generated_text"])
 ## Training procedure
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/yyr/huggingface/runs/mqg6o7d2)
 This model was trained with DPO, a method introduced in [Direct Preference Optimization: Your Language Model is Secretly a Reward Model](https://huggingface.co/papers/2305.18290).

 ## Training procedure
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/yyr/huggingface/runs/jliwl3e2)
 This model was trained with DPO, a method introduced in [Direct Preference Optimization: Your Language Model is Secretly a Reward Model](https://huggingface.co/papers/2305.18290).

last_checkpoint/model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:dcf0cad79011305616ed7dfc527129c71d5153ea77de17b0fe8cb95cc1d38756
 size 4965805240

 version https://git-lfs.github.com/spec/v1
+oid sha256:09f9dfac1a1368b1e2c1c9a921d36fbc5a848dd15d54d13f1a74c52669716615
 size 4965805240

last_checkpoint/model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1339c05a7564ae0e07bbd59ada290f1880092cf5812ae8a08aa558e614e78733
 size 2247741136

 version https://git-lfs.github.com/spec/v1
+oid sha256:b653a334988c5cd39dc416bec220c0ba8a06bc116fa22ca18d18b9f75857c28a
 size 2247741136

last_checkpoint/training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5fe52c7646e9303442e0db1e859a82b84b9ec1655b043fdd45c7cbf938639752
 size 7608

 version https://git-lfs.github.com/spec/v1
+oid sha256:4b9f4a147abc03fca79ab9617d9fc437957c7003d43e67ebb80f94d43a3d3cb2
 size 7608