miulab
/

llama2-7b-alpaca-sft-10k

Text Generation

Model card Files Files and versions Community

hank0316 commited on Oct 3

Commit

fa9d67e

•

1 Parent(s): c9bbb63

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -11,6 +11,8 @@ pipeline_tag: text-generation
 This is the backbone SFT model used in the paper "[DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging](https://arxiv.org/abs/2407.01470)".
 For the detailed information about this model, please refer to our paper.
 If you found this model useful, please cite our paper:

 This is the backbone SFT model used in the paper "[DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging](https://arxiv.org/abs/2407.01470)".
+The detailed training/evaluation information can be found at https://api.wandb.ai/links/merge_exp/2qs92v6f.
 For the detailed information about this model, please refer to our paper.
 If you found this model useful, please cite our paper: