philschmid
/

gemma-7b-dolly-chatml

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

philschmid HF staff commited on Feb 27

Commit

4070fa7

•

1 Parent(s): ddacb92

Update README.md

Files changed (1) hide show

README.md +4 -15

README.md CHANGED Viewed

@@ -18,21 +18,14 @@ should probably proofread and complete it, then remove this comment. -->
 # gemma-7b-dolly-chatml
-This model is a fine-tuned version of [google/gemma-7b](https://huggingface.co/google/gemma-7b) on the generator dataset.
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
@@ -46,10 +39,6 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_ratio: 0.03
 - num_epochs: 3
-### Training results
 ### Framework versions
 - PEFT 0.8.2

 # gemma-7b-dolly-chatml
+This model is a fine-tuned version of [google/gemma-7b](https://huggingface.co/google/gemma-7b) with [philschmid/gemma-tokenizer-chatml](https://huggingface.co/philschmid/gemma-tokenizer-chatml) tokenizer on the [philschmid/dolly-15k-oai-style](https://huggingface.co/datasets/philschmid/dolly-15k-oai-style) using the chatML format.
+The model was fine-tuned with the following [script using Lora (no, qlora)](). I also included a [inference script]() to make sure it works since there were some issues with Gemma. Results of the inference test are
+```bash
+```
 ### Training hyperparameters
 - lr_scheduler_warmup_ratio: 0.03
 - num_epochs: 3
 ### Framework versions
 - PEFT 0.8.2