OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training Paper • 2501.08197 • Published 20 days ago • 7
Gradient-Mask Tuning Elevates the Upper Limits of LLM Performance Paper • 2406.15330 • Published Jun 21, 2024
Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training Paper • 2411.14318 • Published Nov 21, 2024
EpiCoder: Encompassing Diversity and Complexity in Code Generation Paper • 2501.04694 • Published 26 days ago • 14
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search Paper • 2412.18319 • Published Dec 24, 2024 • 37
Patience Is The Key to Large Language Model Reasoning Paper • 2411.13082 • Published Nov 20, 2024 • 7
Hyper-multi-step: The Truth Behind Difficult Long-context Tasks Paper • 2410.04422 • Published Oct 6, 2024 • 7
"Paraphrasing The Original Text" Makes High Accuracy Long-Context QA Paper • 2312.11193 • Published Dec 18, 2023
An Intelligent Remote Sensing Image Quality Inspection System Paper • 2307.11965 • Published Jul 22, 2023
Mitigate Position Bias in Large Language Models via Scaling a Single Dimension Paper • 2406.02536 • Published Jun 4, 2024
ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model Paper • 2408.16767 • Published Aug 29, 2024 • 30
Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image Paper • 2405.20343 • Published May 30, 2024 • 3
Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion Paper • 2406.04338 • Published Jun 6, 2024 • 35
SEABO: A Simple Search-Based Method for Offline Imitation Learning Paper • 2402.03807 • Published Feb 6, 2024
Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence Paper • 2404.05892 • Published Apr 8, 2024 • 33
DreamReward: Text-to-3D Generation with Human Preference Paper • 2403.14613 • Published Mar 21, 2024 • 36
Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation Paper • 2403.09625 • Published Mar 14, 2024 • 1
Semantic Ray: Learning a Generalizable Semantic Field with Cross-Reprojection Attention Paper • 2303.13014 • Published Mar 23, 2023 • 1