Update README.md
Browse files
README.md
CHANGED
@@ -20,6 +20,8 @@ quantized_by: bartowski
|
|
20 |
|
21 |
## Exllama v2 Quantizations of Phi-3.1-mini-4k-instruct
|
22 |
|
|
|
|
|
23 |
Using <a href="https://github.com/turboderp/exllamav2/releases/tag/v0.1.6">turboderp's ExLlamaV2 v0.1.6</a> for quantization.
|
24 |
|
25 |
<b>The "main" branch only contains the measurement.json, download one of the other branches for the model (see below)</b>
|
|
|
20 |
|
21 |
## Exllama v2 Quantizations of Phi-3.1-mini-4k-instruct
|
22 |
|
23 |
+
<b>I'm calling this Phi-3.1 because Microsoft made the decision to release a huge update in place.. So yes, it's the new model from June 2nd 2024, but I've renamed it for clarity.</b>
|
24 |
+
|
25 |
Using <a href="https://github.com/turboderp/exllamav2/releases/tag/v0.1.6">turboderp's ExLlamaV2 v0.1.6</a> for quantization.
|
26 |
|
27 |
<b>The "main" branch only contains the measurement.json, download one of the other branches for the model (see below)</b>
|