hazyresearch
/

mamba-1b-50b

Inference Endpoints

Model card Files Files and versions Community

simarora commited on Apr 20, 2024

Commit

fe69dcb

•

1 Parent(s): eb13565

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -9,6 +9,8 @@ language:
 This model is pretrained as a reference baseline to the Based model provided here: https://huggingface.co/hazyresearch/based-1b-50b
 Both checkpoints are pretrained on 50Bn tokens of the Pile in the exact same data order using next token prediction.
 ### Model Sources
@@ -43,3 +45,5 @@ Please consider citing this paper if you use our work:
   year={2024}
 }
 ```

 This model is pretrained as a reference baseline to the Based model provided here: https://huggingface.co/hazyresearch/based-1b-50b
 Both checkpoints are pretrained on 50Bn tokens of the Pile in the exact same data order using next token prediction.
+A WandB report for training is here: https://api.wandb.ai/links/hazy-research/ggo9rst2
 ### Model Sources
   year={2024}
 }
 ```
+Please reach out to [email protected], [email protected], and [email protected] with questions.