onebitquantized commited on
Commit
8311d35
1 Parent(s): fcc743c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -0
README.md CHANGED
@@ -65,6 +65,20 @@ outputs = model.generate(**inputs, do_sample=True, max_new_tokens=1024)
65
  print(tokenizer.batch_decode(outputs, skip_special_tokens=True))
66
  ```
67
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
68
  # Contact Us
69
 
70
  For additional xMADified models, access to fine-tuning, and general questions, please contact us at [email protected] and join our waiting list.
 
65
  print(tokenizer.batch_decode(outputs, skip_special_tokens=True))
66
  ```
67
 
68
+ # Citation
69
+
70
+ If you found this model useful, please cite our research paper.
71
+
72
+ ```
73
+ @article{zhang2024leanquant,
74
+ title={LeanQuant: Accurate and Scalable Large Language Model Quantization with Loss-error-aware Grid},
75
+ author={Zhang, Tianyi and Shrivastava, Anshumali},
76
+ journal={arXiv preprint arXiv:2407.10032},
77
+ year={2024},
78
+ url={https://arxiv.org/abs/2407.10032},
79
+ }
80
+ ```
81
+
82
  # Contact Us
83
 
84
  For additional xMADified models, access to fine-tuning, and general questions, please contact us at [email protected] and join our waiting list.