DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation Paper • 2503.06053 • Published Mar 8 • 138
LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers Paper • 2503.14434 • Published Mar 18 • 7
SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks Paper • 2503.15478 • Published Mar 19 • 10
Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning Paper • 2503.13360 • Published Mar 17 • 6
GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving Paper • 2503.15672 • Published Mar 19 • 3
Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning Paper • 2503.07906 • Published Mar 10 • 4