hank0316 commited on
Commit
c9bbb63
1 Parent(s): c3f5ce7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md CHANGED
@@ -4,6 +4,9 @@ language:
4
  - en
5
  base_model:
6
  - meta-llama/Llama-2-7b-hf
 
 
 
7
  ---
8
 
9
  This is the backbone SFT model used in the paper "[DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging](https://arxiv.org/abs/2407.01470)".
 
4
  - en
5
  base_model:
6
  - meta-llama/Llama-2-7b-hf
7
+ datasets:
8
+ - tatsu-lab/alpaca_farm
9
+ pipeline_tag: text-generation
10
  ---
11
 
12
  This is the backbone SFT model used in the paper "[DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging](https://arxiv.org/abs/2407.01470)".