VitaliiVrublevskyi commited on
Commit
f94fdae
·
1 Parent(s): eefaf92

End of training

Browse files
README.md CHANGED
@@ -10,6 +10,7 @@ metrics:
10
  model-index:
11
  - name: Llama-2-7b-hf-finetuned-mrpc-v5
12
  results: []
 
13
  ---
14
 
15
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -37,6 +38,17 @@ More information needed
37
 
38
  ## Training procedure
39
 
 
 
 
 
 
 
 
 
 
 
 
40
  ### Training hyperparameters
41
 
42
  The following hyperparameters were used during training:
@@ -62,6 +74,7 @@ The following hyperparameters were used during training:
62
 
63
  ### Framework versions
64
 
 
65
  - Transformers 4.31.0
66
  - Pytorch 2.0.1+cu118
67
  - Datasets 2.14.5
 
10
  model-index:
11
  - name: Llama-2-7b-hf-finetuned-mrpc-v5
12
  results: []
13
+ library_name: peft
14
  ---
15
 
16
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
38
 
39
  ## Training procedure
40
 
41
+
42
+ The following `bitsandbytes` quantization config was used during training:
43
+ - load_in_8bit: True
44
+ - load_in_4bit: False
45
+ - llm_int8_threshold: 6.0
46
+ - llm_int8_skip_modules: None
47
+ - llm_int8_enable_fp32_cpu_offload: False
48
+ - llm_int8_has_fp16_weight: False
49
+ - bnb_4bit_quant_type: fp4
50
+ - bnb_4bit_use_double_quant: False
51
+ - bnb_4bit_compute_dtype: float32
52
  ### Training hyperparameters
53
 
54
  The following hyperparameters were used during training:
 
74
 
75
  ### Framework versions
76
 
77
+ - PEFT 0.4.0
78
  - Transformers 4.31.0
79
  - Pytorch 2.0.1+cu118
80
  - Datasets 2.14.5
runs/Sep26_13-04-56_6470e25f5c24/events.out.tfevents.1695743572.6470e25f5c24.1187.2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:922a197acee92db51502e1a61f235ed188f62ea6d4b195d750162286e0ce853c
3
- size 6400
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8754d878b72311e185acff0713cc22d8aba24c50f851db97bcae9e49fd3a83ec
3
+ size 6754