RichardErkhov
/

ohashi56225_-_phi-2-alpaca-cleaned-4bits

4-bit precision

Model card Files Files and versions Community

RichardErkhov commited on 16 days ago

Commit

c4316e2

·

verified ·

1 Parent(s): 39e03e2

uploaded readme

Files changed (1) hide show

README.md +61 -0

README.md ADDED Viewed

	@@ -0,0 +1,61 @@

+Quantization made by Richard Erkhov.
+[Github](https://github.com/RichardErkhov)
+[Discord](https://discord.gg/pvy7H8DZMG)
+[Request more models](https://github.com/RichardErkhov/quant_request)
+phi-2-alpaca-cleaned - bnb 4bits
+- Model creator: https://huggingface.co/ohashi56225/
+- Original model: https://huggingface.co/ohashi56225/phi-2-alpaca-cleaned/
+Original model description:
+---
+license: mit
+datasets:
+- yahma/alpaca-cleaned
+language:
+- en
+library_name: transformers
+pipeline_tag: text-generation
+---
+# phi-2-alpaca-cleaned
+This model is an instruction-tuned version of the [microsoft/phi-2](https://huggingface.co/microsoft/phi-2) model fine-tuned on the [yahma/alpaca-cleaned](https://huggingface.co/datasets/yahma/alpaca-cleaned) dataset.
+In the training, full parameter fine-tuning of phi-2 was performed, and LoRA was not used.
+## Text Format
+```
+Below is an instruction that describes a task. Write a response that appropriately completes the request.
+### Instruction:
+Based on the information provided, rewrite the sentence by changing its tense from past to future.
+### Input:
+She played the piano beautifully for hours and then stopped as it was midnight.
+### Response:
+She will play the piano beautifully for hours and then stop as it will be midnight.
+```
+## Training
+- GPUs: 8 × A6000 48GB
+- per_device_train_batch_size: 8
+- gradient_accumulation_steps: 8
+- per_device_eval_batch_size: 8
+- num_train_epochs: 3
+- learning_rate: 2e-5
+- warmup_ratio: 0.03
+## Software
+- pytorch: 2.1.2
+- transformers: 4.38.0.dev0
+- accelerate: 0.26.1
+- deepspeed: 0.13.1