Yu_xm's picture

Open to Collab

Yu_xm

Yu2020

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

authored a paper 14 days ago

Anisotropic Modality Align

upvoted a paper 14 days ago

Anisotropic Modality Align

View all activity

Organizations

upvoted a paper 5 days ago

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

Paper • 2605.18703 • Published 7 days ago • 48

upvoted a paper 14 days ago

Anisotropic Modality Align

Paper • 2605.07825 • Published 17 days ago • 27

upvoted a paper about 1 month ago

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

Paper • 2604.11804 • Published Apr 13 • 72

upvoted 4 papers 3 months ago

Chain of Mindset: Reasoning with Adaptive Cognitive Modes

Paper • 2602.10063 • Published Feb 10 • 75

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published Feb 9 • 76

Sparse Reward Subsystem in Large Language Models

Paper • 2602.00986 • Published Feb 1 • 13

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Paper • 2602.07026 • Published Feb 2 • 140

upvoted 5 papers 4 months ago

Beyond Static Tools: Test-Time Tool Evolution for Scientific Reasoning

Paper • 2601.07641 • Published Jan 12 • 48

EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines

Paper • 2601.09465 • Published Jan 14 • 42

KnowMe-Bench: Benchmarking Person Understanding for Lifelong Digital Companions

Paper • 2601.04745 • Published Jan 8 • 59

MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences

Paper • 2601.06789 • Published Jan 11 • 81

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

Paper • 2601.06943 • Published Jan 11 • 214

upvoted a paper 5 months ago

Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation

Paper • 2512.13495 • Published Dec 15, 2025 • 11

upvoted 2 papers 6 months ago

AutoEnv: Automated Environments for Measuring Cross-Environment Agent Learning

Paper • 2511.19304 • Published Nov 24, 2025 • 92

UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios

Paper • 2511.18050 • Published Nov 22, 2025 • 38

upvoted 3 papers about 1 year ago

SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning

Paper • 2505.12448 • Published May 18, 2025 • 10

Unicorn: Text-Only Data Synthesis for Vision Language Model Training

Paper • 2503.22655 • Published Mar 28, 2025 • 38

VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing

Paper • 2502.17258 • Published Feb 24, 2025 • 79