RichardErkhov commited on
Commit
c4316e2
·
verified ·
1 Parent(s): 39e03e2

uploaded readme

Browse files
Files changed (1) hide show
  1. README.md +61 -0
README.md ADDED
@@ -0,0 +1,61 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Quantization made by Richard Erkhov.
2
+
3
+ [Github](https://github.com/RichardErkhov)
4
+
5
+ [Discord](https://discord.gg/pvy7H8DZMG)
6
+
7
+ [Request more models](https://github.com/RichardErkhov/quant_request)
8
+
9
+
10
+ phi-2-alpaca-cleaned - bnb 4bits
11
+ - Model creator: https://huggingface.co/ohashi56225/
12
+ - Original model: https://huggingface.co/ohashi56225/phi-2-alpaca-cleaned/
13
+
14
+
15
+
16
+
17
+ Original model description:
18
+ ---
19
+ license: mit
20
+ datasets:
21
+ - yahma/alpaca-cleaned
22
+ language:
23
+ - en
24
+ library_name: transformers
25
+ pipeline_tag: text-generation
26
+ ---
27
+
28
+ # phi-2-alpaca-cleaned
29
+ This model is an instruction-tuned version of the [microsoft/phi-2](https://huggingface.co/microsoft/phi-2) model fine-tuned on the [yahma/alpaca-cleaned](https://huggingface.co/datasets/yahma/alpaca-cleaned) dataset.
30
+
31
+ In the training, full parameter fine-tuning of phi-2 was performed, and LoRA was not used.
32
+
33
+ ## Text Format
34
+ ```
35
+ Below is an instruction that describes a task. Write a response that appropriately completes the request.
36
+ ### Instruction:
37
+ Based on the information provided, rewrite the sentence by changing its tense from past to future.
38
+
39
+ ### Input:
40
+ She played the piano beautifully for hours and then stopped as it was midnight.
41
+
42
+ ### Response:
43
+ She will play the piano beautifully for hours and then stop as it will be midnight.
44
+ ```
45
+
46
+ ## Training
47
+ - GPUs: 8 × A6000 48GB
48
+ - per_device_train_batch_size: 8
49
+ - gradient_accumulation_steps: 8
50
+ - per_device_eval_batch_size: 8
51
+ - num_train_epochs: 3
52
+ - learning_rate: 2e-5
53
+ - warmup_ratio: 0.03
54
+
55
+ ## Software
56
+ - pytorch: 2.1.2
57
+ - transformers: 4.38.0.dev0
58
+ - accelerate: 0.26.1
59
+ - deepspeed: 0.13.1
60
+
61
+