FuseChat 3.0 Collection Preference Optimization for Implicit Model Fusion • 13 items • Updated 19 days ago • 12
view article Article FuseO1-Preview: System-II Reasoning Fusion of LLMs By Wanfq and 4 others • Jan 20 • 17
Weighted-Reward Preference Optimization for Implicit Model Fusion Paper • 2412.03187 • Published Dec 4, 2024 • 12