assafbk commited on
Commit
8087b82
1 Parent(s): 17bbc21

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -3
README.md CHANGED
@@ -1,3 +1,27 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ inference: false
3
+ license: mit
4
+ tags:
5
+ - text-generation
6
+ - mamba
7
+ - long context
8
+ ---
9
+
10
+ # DeciMamba Checkpoint (Baseline)
11
+
12
+ The official checkpoint of Mamba-130m, finetuned for Language Modeling over the PG-19 dataset as presented in [DeciMamba: Exploring the Length Extrapolation Potential of Mamba](https://arxiv.org/abs/2406.14528).
13
+
14
+ See our [Github Repo](https://github.com/assafbk/DeciMamba) for evalution and training scripts.
15
+
16
+ Bibtex:
17
+ ```
18
+ @misc{benkish2024decimambaexploringlengthextrapolation,
19
+ title={DeciMamba: Exploring the Length Extrapolation Potential of Mamba},
20
+ author={Assaf Ben-Kish and Itamar Zimerman and Shady Abu-Hussein and Nadav Cohen and Amir Globerson and Lior Wolf and Raja Giryes},
21
+ year={2024},
22
+ eprint={2406.14528},
23
+ archivePrefix={arXiv},
24
+ primaryClass={cs.LG},
25
+ url={https://arxiv.org/abs/2406.14528},
26
+ }
27
+ ```