jinjieyuan commited on
Commit
955de36
1 Parent(s): bcbec33

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -0
README.md ADDED
@@ -0,0 +1,27 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: en
3
+ license: apache-2.0
4
+ ---
5
+
6
+ # SQFT: Low-cost Model Adaptation in Low-precision Sparse Foundation Models
7
+
8
+ - Base Model: [IntelLabs/sqft-phi-3-mini-4k-50-base-gptq](https://huggingface.co/IntelLabs/sqft-phi-3-mini-4k-50-base-gptq)
9
+ - Sparsity: 50%
10
+ - Quantization: INT4 (GPTQ)
11
+ - Finetune Method: SQFT
12
+ - Finetune data: 10K instruction-following math reasoning training dataset from [LLM-Adapters](https://github.com/AGI-Edgerunners/LLM-Adapters) ([math_10k.json](https://github.com/AGI-Edgerunners/LLM-Adapters/blob/main/ft-training_set/math_10k.json))
13
+ - Sub-Adapter: Heuristic
14
+
15
+ ### Evaluation
16
+
17
+ ```bash
18
+ git clone https://github.com/IntelLabs/Hardware-Aware-Automated-Machine-Learning.git haaml && cd haaml/SQFT
19
+
20
+ BASE_MODEL_PATH=IntelLabs/sqft-phi-3-mini-4k-50-base-gptq
21
+ ADAPTER_MODEL_PATH=IntelLabs/sqft-phi-3-mini-4k-50-gptq-math-heu-adapter
22
+ OUTPUT_DIR=./results
23
+ python eval/evaluate_math.py --base_model_path ${BASE_MODEL_PATH} --adapter_model_path ${ADAPTER_MODEL_PATH} --output_dir ${OUTPUT_DIR}
24
+ ```
25
+
26
+ Refer to our [repo](https://github.com/IntelLabs/Hardware-Aware-Automated-Machine-Learning/tree/main/SQFT) for the environment information to run this command.
27
+