21 79 157

Hyoung-Kyu Song

deepkyu

https://linktr.ee/deepkyu

AI & ML interests

Efficient model for image/video generation

Recent Activity

upvoted a paper about 2 months ago

Latent Diffusion Model without Variational Autoencoder

upvoted a paper about 2 months ago

Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

liked a dataset about 2 months ago

QingyanBai/Ditto-1M

View all activity

Organizations

upvoted 3 papers about 2 months ago

Latent Diffusion Model without Variational Autoencoder

Paper • 2510.15301 • Published Oct 17 • 48

Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Paper • 2510.15742 • Published Oct 17 • 50

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

Paper • 2510.05684 • Published Oct 7 • 141

upvoted a paper 3 months ago

Lynx: Towards High-Fidelity Personalized Video Generation

Paper • 2509.15496 • Published Sep 19 • 12

upvoted a paper 5 months ago

JAM-Flow: Joint Audio-Motion Synthesis with Flow Matching

Paper • 2506.23552 • Published Jun 30 • 11

upvoted a paper 6 months ago

Seeing Voices: Generating A-Roll Video from Audio with Mirage

Paper • 2506.08279 • Published Jun 9 • 27

upvoted a paper 8 months ago

Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features

Paper • 2504.00557 • Published Apr 1 • 15

upvoted a paper 9 months ago

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation

Paper • 2503.09641 • Published Mar 12 • 41

upvoted a collection 9 months ago

SANA-Sprint

Collection

🏃SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation • 6 items • Updated Sep 13 • 43

upvoted 5 papers 12 months ago

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Paper • 2412.17739 • Published Dec 23, 2024 • 41

upvoted 6 papers about 1 year ago

FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on

Paper • 2411.10499 • Published Nov 15, 2024 • 13

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 56

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Paper • 2410.19355 • Published Oct 25, 2024 • 23

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 130

GPT-4o System Card

Paper • 2410.21276 • Published Oct 25, 2024 • 87

MarDini: Masked Autoregressive Diffusion for Video Generation at Scale

Paper • 2410.20280 • Published Oct 26, 2024 • 23

Hyoung-Kyu Song

AI & ML interests

Recent Activity

Organizations

deepkyu's activity