Open to Collab

7 121 271

Muhammad Umair

umair894

AI & ML interests

Multimodal Reidentification | Feature Upscaling | Object Tracking |PhD UESTC

Recent Activity

upvoted a paper 5 days ago

DreaMontage: Arbitrary Frame-Guided One-Shot Video Generation

upvoted a paper 6 days ago

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

liked a Space 7 days ago

facebook/map-anything

View all activity

Organizations

upvoted a paper 5 days ago

DreaMontage: Arbitrary Frame-Guided One-Shot Video Generation

Paper • 2512.21252 • Published 5 days ago • 32

upvoted a paper 6 days ago

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

Paper • 2512.19673 • Published 7 days ago • 60

upvoted a paper 7 days ago

Name That Part: 3D Part Segmentation and Naming

Paper • 2512.18003 • Published 10 days ago • 3

upvoted a paper 20 days ago

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Paper • 2512.08765 • Published 20 days ago • 127

upvoted a paper 24 days ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published 25 days ago • 167

upvoted 5 papers about 1 month ago

upvoted 2 papers about 2 months ago

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Paper • 2510.14528 • Published Oct 16 • 109

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5 • 121

upvoted 8 papers 2 months ago

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Paper • 2510.23607 • Published Oct 27 • 177

A Definition of AGI

Paper • 2510.18212 • Published Oct 21 • 34

VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos

Paper • 2510.19488 • Published Oct 22 • 19

DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents

Paper • 2510.19336 • Published Oct 22 • 16

Chronos-2: From Univariate to Universal Forecasting

Paper • 2510.15821 • Published Oct 17 • 19

Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Paper • 2510.15742 • Published Oct 17 • 50

ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond Semantic Dependency Constraints

Paper • 2510.14847 • Published Oct 16 • 55

BitNet Distillation

Paper • 2510.13998 • Published Oct 15 • 55

Muhammad Umair

AI & ML interests

Recent Activity

Organizations

umair894's activity