Wanfq commited on
Commit
be9f18e
Β·
verified Β·
1 Parent(s): 5f28ccd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -4
README.md CHANGED
@@ -15,7 +15,7 @@ pinned: false
15
  <p style="font-size: 40px; font-weight: bold;">Knowledge Fusion of Large Language Models</p>
16
 
17
 
18
- <h4> |<a href="https://arxiv.org/abs/2401.10491"> πŸ“‘ FuseLLM Paper @ICLR2024 </a> | <a href="https://arxiv.org/abs/2408.07990"> πŸ“‘ FuseChat Tech Report </a> | <a href="https://arxiv.org/abs/2412.03187"> πŸ“‘ WRPO Tech Report </a> | <a href="https://huggingface.co/FuseAI"> πŸ€— HuggingFace Repo </a> | <a href="https://github.com/fanqiwan/FuseLLM"> 🐱 GitHub Repo </a> | <a href="https://huggingface.co/blog/Wanfq/fusechat-3"> 🌐 FuseChat-3.0 Blog </a> | <a href="https://huggingface.co/blog/Wanfq/fuseo1-preview"> 🌐 FuseO1-Preview Blog </a> |
19
  </h4>
20
 
21
  <p align="center">
@@ -115,10 +115,11 @@ Please cite the following paper if you reference our model, code, data, or paper
115
 
116
  Please cite the following paper if you reference our model, code, data, or paper related to WRPO.
117
  ```
118
- @article{yang2024wrpo,
119
  title={Weighted-Reward Preference Optimization for Implicit Model Fusion},
120
  author={Ziyi Yang and Fanqi Wan and Longguang Zhong and Tianyuan Shi and Xiaojun Quan},
121
- journal={arXiv preprint arXiv:2412.03187},
122
- year={2024}
 
123
  }
124
  ```
 
15
  <p style="font-size: 40px; font-weight: bold;">Knowledge Fusion of Large Language Models</p>
16
 
17
 
18
+ <h4> |<a href="https://arxiv.org/abs/2401.10491"> πŸ“‘ FuseLLM Paper @ICLR2024 </a> | <a href="https://arxiv.org/abs/2408.07990"> πŸ“‘ FuseChat Tech Report </a> | <a href="https://arxiv.org/abs/2412.03187"> πŸ“‘ WRPO Paper @ICLR2025 </a> | <a href="https://huggingface.co/FuseAI"> πŸ€— HuggingFace Repo </a> | <a href="https://github.com/fanqiwan/FuseLLM"> 🐱 GitHub Repo </a> | <a href="https://huggingface.co/blog/Wanfq/fusechat-3"> 🌐 FuseChat-3.0 Blog </a> | <a href="https://huggingface.co/blog/Wanfq/fuseo1-preview"> 🌐 FuseO1-Preview Blog </a> |
19
  </h4>
20
 
21
  <p align="center">
 
115
 
116
  Please cite the following paper if you reference our model, code, data, or paper related to WRPO.
117
  ```
118
+ @inproceedings{yang2025weightedreward,
119
  title={Weighted-Reward Preference Optimization for Implicit Model Fusion},
120
  author={Ziyi Yang and Fanqi Wan and Longguang Zhong and Tianyuan Shi and Xiaojun Quan},
121
+ booktitle={The Thirteenth International Conference on Learning Representations},
122
+ year={2025},
123
+ url={https://openreview.net/forum?id=fq24pEb8SL}
124
  }
125
  ```