chrisliu298
commited on
Commit
•
7fbc269
1
Parent(s):
f94f78b
Update README.md
Browse files
README.md
CHANGED
@@ -27,7 +27,7 @@ pipeline_tag: text-classification
|
|
27 |
|
28 |
We include only public data in an attempt to demonstrate that high-performance reward models can be achieved with a relatively small dataset and straightforward data curation techniques, without further algorithmic or architectural modifications. The sources of data used in the [Skywork Reward Data Collection](https://huggingface.co/collections/Skywork/skywork-reward-data-collection-66d7fda6a5098dc77035336d) are detailed in the [Data Mixture](#data-mixture) section below.
|
29 |
|
30 |
-
The resulting reward models excel at handling preferences in complex scenarios, including challenging preference pairs, and span various domains such as mathematics, coding, and safety.
|
31 |
|
32 |
## Data Mixture
|
33 |
|
|
|
27 |
|
28 |
We include only public data in an attempt to demonstrate that high-performance reward models can be achieved with a relatively small dataset and straightforward data curation techniques, without further algorithmic or architectural modifications. The sources of data used in the [Skywork Reward Data Collection](https://huggingface.co/collections/Skywork/skywork-reward-data-collection-66d7fda6a5098dc77035336d) are detailed in the [Data Mixture](#data-mixture) section below.
|
29 |
|
30 |
+
The resulting reward models excel at handling preferences in complex scenarios, including challenging preference pairs, and span various domains such as mathematics, coding, and safety.
|
31 |
|
32 |
## Data Mixture
|
33 |
|