CodeFuse-13B: A Pretrained Multi-lingual Code Large Language Model Paper • 2310.06266 • Published Oct 10, 2023 • 1
MFTCoder: Boosting Code LLMs with Multitask Fine-Tuning Paper • 2311.02303 • Published Nov 4, 2023 • 9
CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models Paper • 2410.06741 • Published Oct 9, 2024 • 1
Every Sample Matters: Leveraging Mixture-of-Experts and High-Quality Data for Efficient and Accurate Code LLM Paper • 2503.17793 • Published 22 days ago • 2
Ring Collection Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI, derived from Ling. • 1 item • Updated 12 days ago
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study Paper • 2404.10719 • Published Apr 16, 2024 • 5