DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning Paper • 2402.09136 • Published Feb 14, 2024 • 1
CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery Paper • 2406.08587 • Published Jun 12, 2024 • 15
We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning? Paper • 2407.01284 • Published Jul 1, 2024 • 76
How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data Paper • 2409.03810 • Published Sep 5, 2024 • 32
SEAS: Self-Evolving Adversarial Safety Optimization for Large Language Models Paper • 2408.02632 • Published Aug 5, 2024 • 1
SEAS: Self-Evolving Adversarial Safety Optimization for Large Language Models Paper • 2408.02632 • Published Aug 5, 2024 • 1