Minho Park

mpark

10 24 11

https://pmh9960.github.io/

AI & ML interests

Computer Vision, Robotics

Recent Activity

updated a dataset 11 days ago

DAVIAN-Robotics/RoboLab-BananaInBowl-30cm-50k-hold8

published a dataset 11 days ago

DAVIAN-Robotics/RoboLab-BananaInBowl-30cm-50k-hold8

updated a dataset 12 days ago

DAVIAN-Robotics/RoboLab-BananaInBowl-30cm-50k-hold15

View all activity

Organizations

upvoted a paper 3 months ago

Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models

Paper • 2603.25750 • Published Mar 20 • 36

upvoted a paper 4 months ago

Residual Off-Policy RL for Finetuning Behavior Cloning Policies

Paper • 2509.19301 • Published Sep 23, 2025 • 20

upvoted a paper 6 months ago

InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion

Paper • 2512.17504 • Published Dec 19, 2025 • 99

upvoted 3 papers 7 months ago

Infinite-Homography as Robust Conditioning for Camera-Controlled Video Generation

Paper • 2512.17040 • Published Dec 18, 2025 • 29

Vector Prism: Animating Vector Graphics by Stratifying Semantic Structure

Paper • 2512.14336 • Published Dec 16, 2025 • 32

EgoX: Egocentric Video Generation from a Single Exocentric Video

Paper • 2512.08269 • Published Dec 9, 2025 • 124

upvoted 3 papers 8 months ago

upvoted a paper 9 months ago

RL makes MLLMs see better than SFT

Paper • 2510.16333 • Published Oct 18, 2025 • 49

upvoted an article 11 months ago

Article

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

nvidia

•

Jun 11, 2025

• 134

upvoted a paper 12 months ago

DesignLab: Designing Slides Through Iterative Detection and Correction

Paper • 2507.17202 • Published Jul 23, 2025 • 51

upvoted 2 papers about 1 year ago

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

Paper • 2504.15271 • Published Apr 21, 2025 • 69

SphereDiff: Tuning-free Omnidirectional Panoramic Image and Video Generation via Spherical Latent Representation

Paper • 2504.14396 • Published Apr 19, 2025 • 27

upvoted 4 papers over 1 year ago

DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes

Paper • 2412.11100 • Published Dec 15, 2024 • 7

OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation

Paper • 2407.02371 • Published Jul 2, 2024 • 55

Spatiotemporal Skip Guidance for Enhanced Video Diffusion Sampling

Paper • 2411.18664 • Published Nov 27, 2024 • 24

SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning

Paper • 2410.09754 • Published Oct 13, 2024 • 9

upvoted a paper about 2 years ago

Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction

Paper • 2404.02905 • Published Apr 3, 2024 • 74

upvoted a paper over 2 years ago

Diffusion Model Alignment Using Direct Preference Optimization

Paper • 2311.12908 • Published Nov 21, 2023 • 49

Minho Park

AI & ML interests

Recent Activity

Organizations

mpark's activity

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm