Takashi Shibuya's picture

1

Takashi Shibuya

TakashiShibuyaSony

TakashiShibuyaSony

AI & ML interests

None yet

Recent Activity

authored a paper 2 days ago

A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation

authored a paper 2 days ago

Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

upvoted a paper 2 days ago

Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

View all activity

Organizations

None yet

TakashiShibuyaSony's activity

authored 2 papers 2 days ago

A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation

Paper • 2409.17550 • Published Sep 26

Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Paper • 2412.15322 • Published 6 days ago • 15

upvoted a paper 2 days ago

Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Paper • 2412.15322 • Published 6 days ago • 15

authored 2 papers 4 months ago

BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network

Paper • 2309.02836 • Published Sep 6, 2023

GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping

Paper • 2405.17251 • Published May 27 • 2

authored 3 papers 7 months ago

SAN: Inducing Metrizability of GAN with Discriminative Normalized Linear Layer

Paper • 2301.12811 • Published Jan 30, 2023

Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation

Paper • 2405.14598 • Published May 23 • 11

SoundCTM: Uniting Score-based and Consistency Models for Text-to-Sound Generation

Paper • 2405.18503 • Published May 28 • 9