Spaces:

FuseAI
/

README

Running

Wanfq commited on 3 days ago

Commit

be9f18e

verified ·

1 Parent(s): 5f28ccd

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ pinned: false
 <p style="font-size: 40px; font-weight: bold;">Knowledge Fusion of Large Language Models</p>
-<h4> |<a href="https://arxiv.org/abs/2401.10491"> 📑 FuseLLM Paper @ICLR2024 </a> | <a href="https://arxiv.org/abs/2408.07990"> 📑 FuseChat Tech Report </a> | <a href="https://arxiv.org/abs/2412.03187"> 📑 WRPO Tech Report </a> | <a href="https://huggingface.co/FuseAI"> 🤗 HuggingFace Repo </a> | <a href="https://github.com/fanqiwan/FuseLLM"> 🐱 GitHub Repo </a> | <a href="https://huggingface.co/blog/Wanfq/fusechat-3"> 🌐 FuseChat-3.0 Blog </a> | <a href="https://huggingface.co/blog/Wanfq/fuseo1-preview"> 🌐 FuseO1-Preview Blog </a> |
 </h4>
 <p align="center">
@@ -115,10 +115,11 @@ Please cite the following paper if you reference our model, code, data, or paper
 Please cite the following paper if you reference our model, code, data, or paper related to WRPO.
 ```
-@article{yang2024wrpo,
   title={Weighted-Reward Preference Optimization for Implicit Model Fusion},
   author={Ziyi Yang and Fanqi Wan and Longguang Zhong and Tianyuan Shi and Xiaojun Quan},
-  journal={arXiv preprint arXiv:2412.03187},
-  year={2024}
 }
 ```

 <p style="font-size: 40px; font-weight: bold;">Knowledge Fusion of Large Language Models</p>
+<h4> |<a href="https://arxiv.org/abs/2401.10491"> 📑 FuseLLM Paper @ICLR2024 </a> | <a href="https://arxiv.org/abs/2408.07990"> 📑 FuseChat Tech Report </a> | <a href="https://arxiv.org/abs/2412.03187"> 📑 WRPO Paper @ICLR2025 </a> | <a href="https://huggingface.co/FuseAI"> 🤗 HuggingFace Repo </a> | <a href="https://github.com/fanqiwan/FuseLLM"> 🐱 GitHub Repo </a> | <a href="https://huggingface.co/blog/Wanfq/fusechat-3"> 🌐 FuseChat-3.0 Blog </a> | <a href="https://huggingface.co/blog/Wanfq/fuseo1-preview"> 🌐 FuseO1-Preview Blog </a> |
 </h4>
 <p align="center">
 Please cite the following paper if you reference our model, code, data, or paper related to WRPO.
 ```
+@inproceedings{yang2025weightedreward,
   title={Weighted-Reward Preference Optimization for Implicit Model Fusion},
   author={Ziyi Yang and Fanqi Wan and Longguang Zhong and Tianyuan Shi and Xiaojun Quan},
+  booktitle={The Thirteenth International Conference on Learning Representations},
+  year={2025},
+  url={https://openreview.net/forum?id=fq24pEb8SL}
 }
 ```