moxin-org
/

moxin-llm-7b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

piuzha commited on 23 days ago

Commit

5f40679

·

verified ·

1 Parent(s): 1604958

Update README.md

Files changed (1) hide show

README.md +10 -2

README.md CHANGED Viewed

@@ -65,7 +65,7 @@ print(sequences[0]['generated_text'])
 ## Evaluation
-We test the performance of our model with [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness). The evaluation results on common datasets are shown below. We test on AI2 Reasoning Challenge (25-shot), HellaSwag (10-shot), MMLU (5-shot), and Winogrande (5-shot).
 |          Models         | ARC-C | Hellaswag |  MMLU | WinoGrade |  Ave  |
 |:----------------------:|:-----:|:---------:|:-----:|:---------:|:-----:|
@@ -100,7 +100,15 @@ We also test the zero shot performance on AI2 Reasoning Challenge (0-shot), AI2
 | Moxin-7B-finetune 	|   80.03   	|   75.17   	| 82.24 	| 81.12 	| 58.64 	| 75.44 	|

 ## Evaluation
+We test the performance of our model with [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness). The evaluation results on common datasets are shown below. We test on AI2 Reasoning Challenge (25-shot), HellaSwag (10-shot), MMLU (5-shot), and Winogrande (5-shot).  We release the Moxin-7B-finetuned as our base model. We further finetune our base model on Tulu v2 to obtain our chat model.
 |          Models         | ARC-C | Hellaswag |  MMLU | WinoGrade |  Ave  |
 |:----------------------:|:-----:|:---------:|:-----:|:---------:|:-----:|
 | Moxin-7B-finetune 	|   80.03   	|   75.17   	| 82.24 	| 81.12 	| 58.64 	| 75.44 	|
+## Citation
+```
+@article{zhao2024fully,
+  title={Fully Open Source Moxin-7B Technical Report},
+  author={Zhao, Pu and Shen, Xuan and Kong, Zhenglun and Shen, Yixin and Chang, Sung-En and Rupprecht, Timothy and Lu, Lei and Nan, Enfu and Yang, Changdi and He, Yumei and others},
+  journal={arXiv preprint arXiv:2412.06845},
+  year={2024}
+}
+```