MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft Paper • 2504.08388 • Published 4 days ago • 35
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model Paper • 2504.08685 • Published 4 days ago • 99
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published 7 days ago • 141
Efficient Model Selection for Time Series Forecasting via LLMs Paper • 2504.02119 • Published 13 days ago • 16
FreSca: Unveiling the Scaling Space in Diffusion Models Paper • 2504.02154 • Published 13 days ago • 17
NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representations Paper • 2503.23162 • Published 17 days ago • 11
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published 15 days ago • 239
RDTF: Resource-efficient Dual-mask Training Framework for Multi-frame Animated Sticker Generation Paper • 2503.17735 • Published 24 days ago • 3
Unleashing Vecset Diffusion Model for Fast Shape Generation Paper • 2503.16302 • Published 26 days ago • 43
Modifying Large Language Model Post-Training for Diverse Creative Writing Paper • 2503.17126 • Published 25 days ago • 35
JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse Paper • 2503.16365 • Published 26 days ago • 38
RWKV-7 "Goose" with Expressive Dynamic State Evolution Paper • 2503.14456 • Published 28 days ago • 137
Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills Paper • 2503.12533 • Published about 1 month ago • 63
Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models Paper • 2503.11224 • Published Mar 14 • 26
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video Paper • 2503.11647 • Published Mar 14 • 132