nicholasKluge
/

RewardModel

Text Classification

preference model

Inference Endpoints

Model card Files Files and versions Community

nicholasKluge commited on Jun 13, 2023

Commit

999e7d9

·

1 Parent(s): 6329d88

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -25,7 +25,7 @@ The `RewardModel` is a [BERT](https://huggingface.co/bert-base-cased)model that
 The model was trained with a dataset composed of `prompt`, `prefered_completions`, and `rejected_completions`.
-> Note: These prompt + completions are samples of intruction datasets created via the [Self-Instruct](https://github.com/yizhongw/self-instruct) framework.
 ## Details
@@ -122,7 +122,7 @@ This will output the following:
 |---|---|
 | [Aira-RewardModel](https://huggingface.co/nicholasKluge/RewardModel)  | 96.54%*  |
-* Only considering comparisons of the `webgpt_comparisons` dataset that had a preferred option.
 ## License

 The model was trained with a dataset composed of `prompt`, `prefered_completions`, and `rejected_completions`.
+These prompt + completions are samples of intruction datasets created via the [Self-Instruct](https://github.com/yizhongw/self-instruct) framework.
 ## Details
 |---|---|
 | [Aira-RewardModel](https://huggingface.co/nicholasKluge/RewardModel)  | 96.54%*  |
+* *Only considering comparisons of the `webgpt_comparisons` dataset that had a preferred option.
 ## License