Locutusque commited on
Commit
bdca3ed
1 Parent(s): 2bec1f5

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +22 -0
README.md ADDED
@@ -0,0 +1,22 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - M4-ai/Rhino
4
+ language:
5
+ - en
6
+ ---
7
+ # Model description
8
+ The model was trained on approximately 200,000 examples from the M4-ai/Rhino dataset, with no examples omitted. It underwent eight checkpoints during training, with the second checkpoint identified as the best-performing one (after processing approximately 50,000 examples).
9
+
10
+ This model has not been aligned with DPO. In the future, different repositories will be released that contain versions of this model aligned with DPO, using various datasets.
11
+
12
+ # Evaluation
13
+ Upon personal testing, the model demonstrates excellent performance in mathematics, history, and coding tasks.
14
+
15
+ However, due to the model's requirement for the setting trust_remote_code=True, it cannot be submitted to the Open LLM Leaderboard. As a result, I will conduct my own evaluation of its capabilities.
16
+
17
+ # Recommended prompt format
18
+ ```
19
+ <|im_start|>system message <|im_end|> <|im_start|>user message <|im_end|> <|im_start|>assistant message <|im_end|>
20
+ ```
21
+
22
+ I recommend passing eos_token="<|im_end|>" when initializing the tokenizer.