kartikmosaicml
commited on
Commit
•
2d1dde9
1
Parent(s):
1f38a81
Update README.md
Browse files
README.md
CHANGED
@@ -176,15 +176,15 @@ The model was trained on the following data mix:
|
|
176 |
|
177 |
| Data Source | Number of Tokens in Source | Proportion |
|
178 |
|-------------|----------------------------|------------|
|
179 |
-
| competition_math | 1.6 M | 3.
|
180 |
-
| cot_gsm8k | 3.36 M |
|
181 |
-
| dialogsum | 0.1 M | 0.
|
182 |
-
| dolly_hhrlhf | 5.89 M |
|
183 |
-
| duorc | 8
|
184 |
-
| qasper |
|
185 |
-
| quality | 11.
|
186 |
-
| scrolls/summ_screen_fd |
|
187 |
-
| spider | 0.089 M | 0.
|
188 |
|
189 |
## PreTraining Data
|
190 |
|
|
|
176 |
|
177 |
| Data Source | Number of Tokens in Source | Proportion |
|
178 |
|-------------|----------------------------|------------|
|
179 |
+
| competition_math | 1.6 M | 3.66% |
|
180 |
+
| cot_gsm8k | 3.36 M | 7.67% |
|
181 |
+
| dialogsum | 0.1 M | 0.23% |
|
182 |
+
| dolly_hhrlhf | 5.89 M | 13.43% |
|
183 |
+
| duorc | 7.8 M | 17.80% |
|
184 |
+
| qasper | 8.72 M | 19.90% |
|
185 |
+
| quality | 11.29 M | 25.78% |
|
186 |
+
| scrolls/summ_screen_fd | 4.97 M | 11.33% |
|
187 |
+
| spider | 0.089 M | 0.20% |
|
188 |
|
189 |
## PreTraining Data
|
190 |
|