Audio-Visual

non-profit

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

wchai authored a paper 6 days ago

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

wchai authored a paper 4 months ago

PAD: Personalized Alignment at Decoding-Time

wchai authored a paper 4 months ago

SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory

View all activity

AV-dataset's activity

wchai

authored a paper 6 days ago

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Paper • 2502.20172 • Published 7 days ago • 25

wchai

authored 2 papers 4 months ago

PAD: Personalized Alignment at Decoding-Time

Paper • 2410.04070 • Published Oct 5, 2024

SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory

Paper • 2411.11922 • Published Nov 18, 2024 • 19

DwanZhang

authored 4 papers 4 months ago

wchai

authored a paper 5 months ago

AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark

Paper • 2410.03051 • Published Oct 4, 2024 • 6

wchai

authored 2 papers 6 months ago

Chasing Consistency in Text-to-3D Generation from a Single Image

Paper • 2309.03599 • Published Sep 7, 2023 • 1

RT-Pose: A 4D Radar Tensor-based 3D Human Pose Estimation and Localization Benchmark

Paper • 2407.13930 • Published Jul 18, 2024

DwanZhang

authored a paper about 1 year ago

Emo-Avatar: Efficient Monocular Video Style Avatar through Texture Rendering

Paper • 2402.00827 • Published Feb 1, 2024 • 2

wchai

authored 3 papers over 1 year ago

See and Think: Embodied Agent in Virtual Environment

Paper • 2311.15209 • Published Nov 26, 2023 • 2

StableVideo: Text-driven Consistency-aware Diffusion Video Editing

Paper • 2308.09592 • Published Aug 18, 2023 • 2

MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

Paper • 2307.16449 • Published Jul 31, 2023 • 16

AI & ML interests

Recent Activity

Team members 4

AV-dataset's activity