KaraKaraWitch
commited on
Commit
•
3162216
1
Parent(s):
80bdfa1
Update README.md
Browse files
README.md
CHANGED
@@ -55,4 +55,26 @@ base_model: Qwen/Qwen2.5-72B
|
|
55 |
parameters:
|
56 |
normalize: true
|
57 |
dtype: bfloat16
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
58 |
```
|
|
|
55 |
parameters:
|
56 |
normalize: true
|
57 |
dtype: bfloat16
|
58 |
+
```
|
59 |
+
|
60 |
+
### Prompt Format
|
61 |
+
|
62 |
+
ChatML works for the most part.
|
63 |
+
|
64 |
+
### Sampler Settings
|
65 |
+
|
66 |
+
Personally I use the following:
|
67 |
+
|
68 |
+
```
|
69 |
+
Temp: 1.2
|
70 |
+
Min P: 0.07
|
71 |
+
Rep Pen: 1.1
|
72 |
+
```
|
73 |
+
|
74 |
+
Others have suggested the following:
|
75 |
+
|
76 |
+
```
|
77 |
+
Temp: 1.1
|
78 |
+
Top P: 0.98
|
79 |
+
Min P: 0.05
|
80 |
```
|