Text Generation
PEFT
Safetensors
falcon
dfurman commited on
Commit
40ba213
1 Parent(s): d779b56

Upload model

Browse files
Files changed (3) hide show
  1. README.md +15 -113
  2. adapter_config.json +23 -0
  3. adapter_model.safetensors +3 -0
README.md CHANGED
@@ -1,119 +1,21 @@
1
  ---
2
  library_name: peft
3
- license: unknown
4
- datasets:
5
- - ehartford/dolphin
6
- - garage-bAInd/Open-Platypus
7
- tags:
8
- - falcon
9
- inference: false
10
- pipeline_tag: text-generation
11
  ---
12
-
13
- # falcon-180b-instruct-peft 🦅
14
-
15
- This instruction model was built via parameter-efficient QLoRA finetuning of [falcon-180b](https://huggingface.co/tiiuae/falcon-180B) on the first 5k rows of [ehartford/dolphin](https://huggingface.co/datasets/ehartford/dolphin) and the first 5k riws of [garage-bAInd/Open-Platypus](https://huggingface.co/datasets/garage-bAInd/Open-Platypus). Finetuning was executed on 4x A6000s (48 GB RTX) for roughly XX hours on the [Lambda Labs](https://cloud.lambdalabs.com/instances) platform.
16
-
17
- ### Benchmark metrics
18
-
19
- | Metric | Value |
20
- |-----------------------|-------|
21
- | MMLU (5-shot) | Coming |
22
- | ARC (25-shot) | Coming |
23
- | HellaSwag (10-shot) | Coming |
24
- | TruthfulQA (0-shot) | Coming |
25
- | Avg. | Coming |
26
-
27
- We use state-of-the-art [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) to run the benchmark tests above, using the same version as Hugging Face's [Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard).
28
-
29
- ### Helpful Links
30
-
31
- * Model license: [Falcon-180B TII License](https://huggingface.co/spaces/tiiuae/falcon-180b-license/blob/main/LICENSE.txt)
32
- * Basic usage: coming
33
- * Finetuning code: coming
34
- * Loss curves: coming
35
- * Runtime stats: coming
36
-
37
- ### Example prompts and responses
38
-
39
- Example 1:
40
-
41
- **User**:
42
- > You are a helpful assistant. Write me a numbered list of things to do in New York City.\n
43
-
44
- **falcon-180b-instruct-peft**:
45
- coming
46
-
47
- <br>
48
-
49
- Example 2:
50
-
51
- **User**:
52
-
53
- > You are a helpful assistant. Write a short email inviting my friends to a dinner party on Friday. Respond succinctly.\n
54
-
55
- **falcon-180b-instruct-peft**:
56
-
57
- coming
58
-
59
- <br>
60
-
61
- Example 3:
62
-
63
- **User**:
64
-
65
- > You are a helpful assistant. Tell me a recipe for vegan banana bread.\n
66
-
67
- **falcon-180b-instruct-peft**:
68
-
69
- coming
70
-
71
- <br>
72
-
73
-
74
- ## Finetuning Description
75
-
76
- ![loss curves](https://raw.githubusercontent.com/daniel-furman/sft-demos/main/assets/jul_24_23_1_14_00_log_loss_curves_falcon-180b-instruct-peft.png)
77
-
78
- The above loss curve was generated from the run's private wandb.ai log.
79
-
80
- ## Limitations and Biases
81
-
82
- _The following language is modified from [EleutherAI's GPT-NeoX-20B](https://huggingface.co/EleutherAI/gpt-neox-20b)_
83
-
84
- This model can produce factually incorrect output, and should not be relied on to produce factually accurate information.
85
- This model was trained on various public datasets.
86
- While great efforts have been taken to clean the pretraining data, it is possible that this model could generate lewd, biased or otherwise offensive outputs.
87
-
88
- ## How to Use
89
-
90
- coming
91
-
92
- ### Runtime tests
93
-
94
- coming
95
-
96
- ## Acknowledgements
97
-
98
- This model was finetuned by Daniel Furman on Sep 10, 2023 and is for RESEARCH ONLY PURPOSES.
99
-
100
- ## Disclaimer
101
-
102
- The license on this model does not constitute legal advice. We are not responsible for the actions of third parties who use this model. Please cosult an attorney before using this model for commercial purposes.
103
-
104
- ## tiiuae/falcon-180B citation
105
-
106
- ```
107
- @article{falcon,
108
- title={The Falcon Series of Language Models: Towards Open Frontier Models},
109
- author={Almazrouei, Ebtesam and Alobeidli, Hamza and Alshamsi, Abdulaziz and Cappelli, Alessandro and Cojocaru, Ruxandra and Alhammadi, Maitha and Daniele, Mazzotta and Heslow, Daniel and Launay, Julien and Malartic, Quentin and Noune, Badreddine and Pannier, Baptiste and Penedo, Guilherme},
110
- year={2023}
111
- }
112
- ```
113
-
114
- ---
115
-
116
  ### Framework versions
117
 
118
 
119
- - PEFT 0.5.0.dev0
 
1
  ---
2
  library_name: peft
 
 
 
 
 
 
 
 
3
  ---
4
+ ## Training procedure
5
+
6
+
7
+ The following `bitsandbytes` quantization config was used during training:
8
+ - quant_method: bitsandbytes
9
+ - load_in_8bit: False
10
+ - load_in_4bit: True
11
+ - llm_int8_threshold: 6.0
12
+ - llm_int8_skip_modules: None
13
+ - llm_int8_enable_fp32_cpu_offload: False
14
+ - llm_int8_has_fp16_weight: False
15
+ - bnb_4bit_quant_type: nf4
16
+ - bnb_4bit_use_double_quant: False
17
+ - bnb_4bit_compute_dtype: bfloat16
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
18
  ### Framework versions
19
 
20
 
21
+ - PEFT 0.6.0.dev0
adapter_config.json ADDED
@@ -0,0 +1,23 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "auto_mapping": null,
3
+ "base_model_name_or_path": "tiiuae/falcon-180B",
4
+ "bias": "none",
5
+ "fan_in_fan_out": false,
6
+ "inference_mode": true,
7
+ "init_lora_weights": true,
8
+ "layers_pattern": null,
9
+ "layers_to_transform": null,
10
+ "lora_alpha": 16,
11
+ "lora_dropout": 0.1,
12
+ "modules_to_save": null,
13
+ "peft_type": "LORA",
14
+ "r": 64,
15
+ "revision": null,
16
+ "target_modules": [
17
+ "query_key_value",
18
+ "dense",
19
+ "dense_h_to_4h",
20
+ "dense_4h_to_h"
21
+ ],
22
+ "task_type": "CAUSAL_LM"
23
+ }
adapter_model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8f307196c272f1b306245783eb3906c86cfbd7d990c9e9c33048552b06d6e367
3
+ size 4278283152