Knightcodin
/

Llama-3-8b-64k-PoSE-exl2

Model card Files Files and versions Community

Knightcodin commited on Apr 26, 2024

Commit

63f5fdb

·

verified ·

1 Parent(s): 2c7cd05

Update README.md

Files changed (1) hide show

README.md +7 -0

README.md CHANGED Viewed

@@ -8,6 +8,13 @@ quantized_by: KnightCodin
 ---
 ## Exllama v2 Quantizations of winglian/Llama-3-8b-64k-PoSE
 pipeline_tag: text-generation
 tags:
 - facebook

 ---
 ## Exllama v2 Quantizations of winglian/Llama-3-8b-64k-PoSE
+Using <a href="https://github.com/turboderp/exllamav2/releases/tag/v0.0.19">turboderp's ExLlamaV2 v0.0.19</a> for quantization.
+<b>The "main" branch only contains the measurement.json, download one of the other branches for the model (see below)</b>
+Each branch contains an individual bits per weight, with the main one containing only the meaurement.json for further conversions.
 pipeline_tag: text-generation
 tags:
 - facebook