Locutusque
commited on
Commit
•
bdca3ed
1
Parent(s):
2bec1f5
Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,22 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
datasets:
|
3 |
+
- M4-ai/Rhino
|
4 |
+
language:
|
5 |
+
- en
|
6 |
+
---
|
7 |
+
# Model description
|
8 |
+
The model was trained on approximately 200,000 examples from the M4-ai/Rhino dataset, with no examples omitted. It underwent eight checkpoints during training, with the second checkpoint identified as the best-performing one (after processing approximately 50,000 examples).
|
9 |
+
|
10 |
+
This model has not been aligned with DPO. In the future, different repositories will be released that contain versions of this model aligned with DPO, using various datasets.
|
11 |
+
|
12 |
+
# Evaluation
|
13 |
+
Upon personal testing, the model demonstrates excellent performance in mathematics, history, and coding tasks.
|
14 |
+
|
15 |
+
However, due to the model's requirement for the setting trust_remote_code=True, it cannot be submitted to the Open LLM Leaderboard. As a result, I will conduct my own evaluation of its capabilities.
|
16 |
+
|
17 |
+
# Recommended prompt format
|
18 |
+
```
|
19 |
+
<|im_start|>system message <|im_end|> <|im_start|>user message <|im_end|> <|im_start|>assistant message <|im_end|>
|
20 |
+
```
|
21 |
+
|
22 |
+
I recommend passing eos_token="<|im_end|>" when initializing the tokenizer.
|