wandb
/

gemma-7b-zephyr-sft

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

tcapelle commited on Feb 28

Commit

3e8fdc6

•

1 Parent(s): 97201c0

Update README.md

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -2,7 +2,7 @@
 library_name: transformers
 datasets:
 - HuggingFaceH4/ultrachat_200k
-finetuned_from: google/gemma-7b
 ---
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/llm_surgery/gemma-zephyr)
@@ -11,6 +11,12 @@ finetuned_from: google/gemma-7b
 The [Zephyr](https://huggingface.co/HuggingFaceH4/zephyr-7b-beta) SFT recipe applied on top of Gemma 7B
 ## Recipe
 We trained using the [alignment handbook recipe](https://github.com/huggingface/alignment-handbook/blob/main/scripts/run_sft.py) and logging to W&B

 library_name: transformers
 datasets:
 - HuggingFaceH4/ultrachat_200k
+base_model: google/gemma-7b
 ---
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/llm_surgery/gemma-zephyr)
 The [Zephyr](https://huggingface.co/HuggingFaceH4/zephyr-7b-beta) SFT recipe applied on top of Gemma 7B
+## Model description
+- **Model type:** A 8.5B parameter GPT-like model fine-tuned on a mix of publicly available, synthetic datasets.
+- **Language(s) (NLP):** Primarily English
+- **Finetuned from model:** [google/gemma-7b](https://huggingface.co/google/gemma-7b)
 ## Recipe
 We trained using the [alignment handbook recipe](https://github.com/huggingface/alignment-handbook/blob/main/scripts/run_sft.py) and logging to W&B