Zyphra
/

Zamba-7B-v1

Text Generation

Model card Files Files and versions

BerenMillidge commited on Jun 3, 2024

Commit

99fc66b

·

verified ·

1 Parent(s): 3f953fb

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ Zamba-7B-v1 is a hybrid model between Mamba, a state-space model, and transforme
 Note: the current Huggingface implementation of Zamba performs slower than our internal implementation. We are working to fix this with the Huggingface team.
-Our technical report describing the training of Zamba is available [here](https://arxiv.org/abs/2405.16712)
 ## Quick start
@@ -49,7 +49,8 @@ print(tokenizer.decode(outputs[0]))
 If you find Zamba useful in your work please cite it as:
-```@article{glorioso2024zamba,
   title={Zamba: A Compact 7B SSM Hybrid Model},
   author={Glorioso, Paolo and Anthony, Quentin and Tokpanov, Yury and Whittington, James and Pilault, Jonathan and Ibrahim, Adam and Millidge, Beren},
   journal={arXiv preprint arXiv:2405.16712},

 Note: the current Huggingface implementation of Zamba performs slower than our internal implementation. We are working to fix this with the Huggingface team.
+Our technical report describing the training of Zamba is available [here](https://arxiv.org/abs/2405.16712).
 ## Quick start
 If you find Zamba useful in your work please cite it as:
+```
+@article{glorioso2024zamba,
   title={Zamba: A Compact 7B SSM Hybrid Model},
   author={Glorioso, Paolo and Anthony, Quentin and Tokpanov, Yury and Whittington, James and Pilault, Jonathan and Ibrahim, Adam and Millidge, Beren},
   journal={arXiv preprint arXiv:2405.16712},