miulab
/

llama2-7b-alpaca-sft-10k

Text Generation

Model card Files Files and versions Community

hank0316 commited on Oct 3

Commit

c9bbb63

•

1 Parent(s): c3f5ce7

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -4,6 +4,9 @@ language:
 - en
 base_model:
 - meta-llama/Llama-2-7b-hf
 ---
 This is the backbone SFT model used in the paper "[DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging](https://arxiv.org/abs/2407.01470)".

 - en
 base_model:
 - meta-llama/Llama-2-7b-hf
+datasets:
+- tatsu-lab/alpaca_farm
+pipeline_tag: text-generation
 ---
 This is the backbone SFT model used in the paper "[DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging](https://arxiv.org/abs/2407.01470)".