SemShareKV: Efficient KVCache Sharing for Semantically Similar Prompts via Token-Level LSH Matching Paper • 2509.24832 • Published Sep 29, 2025
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors Paper • 2605.00658 • Published 7 days ago • 80
Map2World: Segment Map Conditioned Text to 3D World Generation Paper • 2605.00781 • Published 7 days ago • 24
From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills Paper • 2604.24026 • Published 11 days ago • 19
ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration Paper • 2605.03042 • Published 4 days ago • 92
PhysForge: Generating Physics-Grounded 3D Assets for Interactive Virtual World Paper • 2605.05163 • Published 2 days ago • 31
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents Paper • 2605.05185 • Published 2 days ago • 86
APEX: Large-scale Multi-task Aesthetic-Informed Popularity Prediction for AI-Generated Music Paper • 2605.03395 • Published 3 days ago • 2
MedSkillAudit: A Domain-Specific Audit Framework for Medical Research Agent Skills Paper • 2604.20441 • Published 16 days ago • 2
Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems Paper • 2605.04018 • Published 3 days ago • 28
SoulX-Duplug: Plug-and-Play Streaming State Prediction Module for Realtime Full-Duplex Speech Conversation Paper • 2603.14877 • Published Mar 16 • 2
SAC: Neural Speech Codec with Semantic-Acoustic Dual-Stream Quantization Paper • 2510.16841 • Published Oct 19, 2025
SoulX-Singer: Towards High-Quality Zero-Shot Singing Voice Synthesis Paper • 2602.07803 • Published Feb 8 • 5