nicholasKluge
/

RewardModel

Text Classification

preference model

Inference Endpoints

Model card Files Files and versions Community

nicholasKluge commited on Jun 7, 2023

Commit

981b0ee

·

1 Parent(s): e584ad0

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -44,6 +44,8 @@ The model was trained with a dataset composed of `prompt`, `completions`, and an
 | 4 |0.024755|0.02109|
 | 5 |0.019445|0.01416|
 ## Usage
 Here's an example of how to use the `RewardModel` to score the quality of a response to a given prompt:

 | 4 |0.024755|0.02109|
 | 5 |0.019445|0.01416|
+> Note: This repository has the notebook used to train this model.
 ## Usage
 Here's an example of how to use the `RewardModel` to score the quality of a response to a given prompt: