Vui Seng Chua commited on
Commit
edd9ad0
1 Parent(s): 5062f79

Rename README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -0
README.md ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ A set of 50% weight-sparse Llama3.1-8B pruned by [Wanda](https://github.com/locuslab/wanda).
3
+ Model links are in the table below. Models can be loaded as is with Huggingface Transformers.
4
+
5
+ ### Perplexity
6
+ ![Perplexity over Sparsity](llama3.1-8B_Wanda_sparsity.png)
7
+ ### MMLU (5-shot)
8
+ | MMLU (5-shot) | Accuracy (%) | Relative to Dense (%) | Model Link|
9
+ |----------------|--------------|-----------------------|-----------|
10
+ | Dense | 65.1 | baseline | [Meta-Llama-3.1-8B-wanda-unstructured-0.0](https://huggingface.co/vuiseng9/Meta-Llama-3.1-8B-wanda-unstructured-0.0) |
11
+ | Unstructured | 50.0 | -15.1 | [Meta-Llama-3.1-8B-wanda-unstructured-0.5](https://huggingface.co/vuiseng9/Meta-Llama-3.1-8B-wanda-unstructured-0.5) |
12
+ | 4:8 | 39.3 | -25.8 | [Meta-Llama-3.1-8B-wanda-4of8](https://huggingface.co/vuiseng9/Meta-Llama-3.1-8B-wanda-4of8) |
13
+ | 2:4 | 28.7 | -36.4 | [Meta-Llama-3.1-8B-wanda-2of4](https://huggingface.co/vuiseng9/Meta-Llama-3.1-8B-wanda-2of4) |
14
+