3 4

Aliaksandr Siarohin

aliaksandr-siarohin

AI & ML interests

None yet

Recent Activity

commented a paper 6 days ago

AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation

upvoted a paper 6 days ago

AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation

upvoted a paper 13 days ago

Mind the Time: Temporally-Controlled Multi-Event Video Generation

View all activity

Organizations

None yet

aliaksandr-siarohin's activity

commented a paper 6 days ago

AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation

Paper • 2412.15191 • Published 7 days ago • 5 •

upvoted a paper 6 days ago

AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation

Paper • 2412.15191 • Published 7 days ago • 5

upvoted a paper 13 days ago

Mind the Time: Temporally-Controlled Multi-Event Video Generation

Paper • 2412.05263 • Published 20 days ago • 10

authored a paper 15 days ago

Video Motion Transfer with Diffusion Transformers

Paper • 2412.07776 • Published 16 days ago • 17

authored a paper 17 days ago

Mind the Time: Temporally-Controlled Multi-Event Video Generation

Paper • 2412.05263 • Published 20 days ago • 10

upvoted a paper 20 days ago

4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion

Paper • 2412.04462 • Published 21 days ago • 7

authored a paper 24 days ago

AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers

Paper • 2411.18673 • Published 29 days ago • 8

authored a paper 6 months ago

VIMI: Grounding Video Generation through Multi-modal Instruction

Paper • 2407.06304 • Published Jul 8 • 9

authored 4 papers 7 months ago

Hierarchical Patch Diffusion Models for High-Resolution Video Generation

Paper • 2406.07792 • Published Jun 12 • 13

4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models

Paper • 2406.07472 • Published Jun 11 • 11

GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement

Paper • 2406.05649 • Published Jun 9 • 8

SF-V: Single Forward Video Generation Model

Paper • 2406.04324 • Published Jun 6 • 23

authored 2 papers 10 months ago

Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Paper • 2402.19479 • Published Feb 29 • 32

Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis

Paper • 2402.14797 • Published Feb 22 • 19

authored a paper 11 months ago

AToM: Amortized Text-to-Mesh using 2D Diffusion

Paper • 2402.00867 • Published Feb 1 • 10

authored a paper 12 months ago

Diffusion Priors for Dynamic View Synthesis from Monocular Videos

Paper • 2401.05583 • Published Jan 10 • 8

authored a paper about 1 year ago

HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion

Paper • 2310.08579 • Published Oct 12, 2023 • 15

upvoted a paper over 1 year ago

AutoDecoding Latent 3D Diffusion Models

Paper • 2307.05445 • Published Jul 7, 2023 • 13