Jianzong Wu PRO

jianzongwu

https://jianzongwu.github.io

jianzongwu

AI & ML interests

Multimodal Learning

Recent Activity

authored a paper about 2 months ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

upvoted a paper about 2 months ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

updated a dataset about 2 months ago

jianzongwu/MangaZero

View all activity

Organizations

None yet

jianzongwu's activity

authored a paper about 2 months ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 45

upvoted a paper about 2 months ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 45

updated a dataset about 2 months ago

jianzongwu/MangaZero

Viewer • Updated Dec 11, 2024 • 32.7k • 124 • 22

updated a model about 2 months ago

jianzongwu/DiffSensei

Updated Dec 11, 2024 • 32

upvoted a paper 2 months ago

Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language

Paper • 2406.20085 • Published Jun 28, 2024 • 11

liked a Space 2 months ago

Meissonic Flow

🚀

Generate images from text descriptions

updated a dataset 3 months ago

jianzongwu/MotionBooth

Preview • Updated Nov 22, 2024 • 39

upvoted a paper 4 months ago

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Paper • 2410.08261 • Published Oct 10, 2024 • 50

liked a model 5 months ago

InstantX/InstantID

Text-to-Image • Updated Jan 22, 2024 • 66.8k • 766

liked a Space 6 months ago

150

UniPortrait

👩

Generate images using text and ID photos

liked a Space 7 months ago

Magi Demo

🏢

Generate transcript from comic image

authored a paper 7 months ago

Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language

Paper • 2406.20085 • Published Jun 28, 2024 • 11

updated a model 7 months ago

jianzongwu/MotionBooth

Updated Jun 28, 2024

upvoted 2 papers 7 months ago

OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding

Paper • 2406.19389 • Published Jun 27, 2024 • 53

MotionBooth: Motion-Aware Customized Text-to-Video Generation

Paper • 2406.17758 • Published Jun 25, 2024 • 19

authored a paper 7 months ago

MotionBooth: Motion-Aware Customized Text-to-Video Generation

Paper • 2406.17758 • Published Jun 25, 2024 • 19

liked a model 8 months ago

liuhaotian/llava-v1.6-34b

Image-Text-to-Text • Updated May 9, 2024 • 13.2k • 343

updated a model 8 months ago

jianzongwu/lgvi

Text-to-Image • Updated Jun 4, 2024 • 73

liked a model 10 months ago

Ligeng-Zhu/panda70m-download

Updated Mar 2, 2024 • 13

updated a model 11 months ago

jianzongwu/lgvi-i

Text-to-Image • Updated Mar 7, 2024 • 9