Patrick Kwon's picture

16 6

Patrick Kwon

yj7082126

·

yj7082126

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Target-Aware Video Diffusion Models

upvoted a paper about 2 months ago

Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion

upvoted a paper 3 months ago

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

View all activity

Organizations

None yet

yj7082126's activity

upvoted a paper 1 day ago

Target-Aware Video Diffusion Models

Paper • 2503.18950 • Published 11 days ago • 2

upvoted a paper about 2 months ago

Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion

Paper • 2501.18804 • Published Jan 30 • 5

upvoted 2 papers 3 months ago

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

Paper • 2501.09755 • Published Jan 16 • 36

Cosmos World Foundation Model Platform for Physical AI

Paper • 2501.03575 • Published Jan 7 • 77

upvoted a paper 4 months ago

FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait

Paper • 2412.01064 • Published Dec 2, 2024 • 28

upvoted 2 papers 7 months ago

LVCD: Reference-based Lineart Video Colorization with Diffusion Models

Paper • 2409.12960 • Published Sep 19, 2024 • 24

Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation

Paper • 2408.15239 • Published Aug 27, 2024 • 29

upvoted 3 papers 8 months ago

MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization

Paper • 2408.02555 • Published Aug 5, 2024 • 32

VidGen-1M: A Large-Scale Dataset for Text-to-video Generation

Paper • 2408.02629 • Published Aug 5, 2024 • 15

Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle

Paper • 2407.19548 • Published Jul 28, 2024 • 27

upvoted 6 papers about 1 year ago

3D Congealing: 3D-Aware Image Alignment in the Wild

Paper • 2404.02125 • Published Apr 2, 2024 • 10

HyperCLOVA X Technical Report

Paper • 2404.01954 • Published Apr 2, 2024 • 23

Pix2Gif: Motion-Guided Diffusion for GIF Generation

Paper • 2403.04634 • Published Mar 7, 2024 • 17

Anything in Any Scene: Photorealistic Video Object Insertion

Paper • 2401.17509 • Published Jan 30, 2024 • 17

SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding

Paper • 2401.09340 • Published Jan 17, 2024 • 21

WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens

Paper • 2401.09985 • Published Jan 18, 2024 • 17