VitaliiVrublevskyi
commited on
Commit
·
e41147f
1
Parent(s):
1baf98e
End of training
Browse files
README.md
CHANGED
@@ -10,6 +10,7 @@ metrics:
|
|
10 |
model-index:
|
11 |
- name: Llama-2-7b-hf-finetuned-mrpc
|
12 |
results: []
|
|
|
13 |
---
|
14 |
|
15 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
@@ -37,6 +38,17 @@ More information needed
|
|
37 |
|
38 |
## Training procedure
|
39 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
40 |
### Training hyperparameters
|
41 |
|
42 |
The following hyperparameters were used during training:
|
@@ -76,6 +88,7 @@ The following hyperparameters were used during training:
|
|
76 |
|
77 |
### Framework versions
|
78 |
|
|
|
79 |
- Transformers 4.31.0
|
80 |
- Pytorch 2.0.1+cu118
|
81 |
- Datasets 2.14.5
|
|
|
10 |
model-index:
|
11 |
- name: Llama-2-7b-hf-finetuned-mrpc
|
12 |
results: []
|
13 |
+
library_name: peft
|
14 |
---
|
15 |
|
16 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
|
38 |
|
39 |
## Training procedure
|
40 |
|
41 |
+
|
42 |
+
The following `bitsandbytes` quantization config was used during training:
|
43 |
+
- load_in_8bit: True
|
44 |
+
- load_in_4bit: False
|
45 |
+
- llm_int8_threshold: 6.0
|
46 |
+
- llm_int8_skip_modules: None
|
47 |
+
- llm_int8_enable_fp32_cpu_offload: False
|
48 |
+
- llm_int8_has_fp16_weight: False
|
49 |
+
- bnb_4bit_quant_type: fp4
|
50 |
+
- bnb_4bit_use_double_quant: False
|
51 |
+
- bnb_4bit_compute_dtype: float32
|
52 |
### Training hyperparameters
|
53 |
|
54 |
The following hyperparameters were used during training:
|
|
|
88 |
|
89 |
### Framework versions
|
90 |
|
91 |
+
- PEFT 0.4.0
|
92 |
- Transformers 4.31.0
|
93 |
- Pytorch 2.0.1+cu118
|
94 |
- Datasets 2.14.5
|
adapter_model.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 345151
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9a093a08fdd909aad620a73d18c6362c4cdf19d4249572828c3370d02821935d
|
3 |
size 345151
|
runs/Sep25_21-52-54_588614263c59/events.out.tfevents.1695682444.588614263c59.1795.3
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:17f98886165e4a08b41e22257f6e1316cf3730f1ad75f3c0f23813bf91ceeea0
|
3 |
+
size 8920
|
runs/Sep25_22-58-42_588614263c59/events.out.tfevents.1695682727.588614263c59.24754.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e6c9593647e001c7171b241fc45d6402fd08d6c3f73ac8eca9cb7ff7d4469444
|
3 |
+
size 4808
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4027
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c625e877c4a2f1ddb063621270f02847f7379c088d69ab1d86d2cd6c8dc48930
|
3 |
size 4027
|