Papers - a aoiandroid Collection

aoiandroid 's Collections

Papers

updated 14 minutes ago

SemShareKV: Efficient KVCache Sharing for Semantically Similar Prompts via Token-Level LSH Matching

Paper • 2509.24832 • Published Sep 29, 2025
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors

Paper • 2605.00658 • Published 7 days ago • 80
Map2World: Segment Map Conditioned Text to 3D World Generation

Paper • 2605.00781 • Published 7 days ago • 24
From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills

Paper • 2604.24026 • Published 11 days ago • 19
ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration

Paper • 2605.03042 • Published 4 days ago • 92
PhysForge: Generating Physics-Grounded 3D Assets for Interactive Virtual World

Paper • 2605.05163 • Published 2 days ago • 31
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

Paper • 2605.05185 • Published 2 days ago • 86
APEX: Large-scale Multi-task Aesthetic-Informed Popularity Prediction for AI-Generated Music

Paper • 2605.03395 • Published 3 days ago • 2
MedSkillAudit: A Domain-Specific Audit Framework for Medical Research Agent Skills

Paper • 2604.20441 • Published 16 days ago • 2
Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems

Paper • 2605.04018 • Published 3 days ago • 28
SoulX-Duplug: Plug-and-Play Streaming State Prediction Module for Realtime Full-Duplex Speech Conversation

Paper • 2603.14877 • Published Mar 16 • 2
SAC: Neural Speech Codec with Semantic-Acoustic Dual-Stream Quantization

Paper • 2510.16841 • Published Oct 19, 2025
SoulX-Singer: Towards High-Quality Zero-Shot Singing Voice Synthesis

Paper • 2602.07803 • Published Feb 8 • 5