1 9 33

Yanhong Zeng

zengyh1900

https://zengyh1900.github.io/

AI & ML interests

Generative AI for Content Creation.

Recent Activity

authored a paper 25 days ago

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

liked a model 3 months ago

internlm/internlm3-8b-instruct-gptq-int4

liked a model 3 months ago

internlm/internlm3-8b-instruct-awq

View all activity

Organizations

None yet

zengyh1900's activity

authored a paper 25 days ago

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

Paper • 2503.19990 • Published 26 days ago • 34

liked 5 models 3 months ago

upvoted a paper 4 months ago

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published Dec 17, 2024 • 95

liked a model 4 months ago

jianzongwu/DiffSensei

Updated Dec 11, 2024 • 37

liked a dataset 4 months ago

jianzongwu/MangaZero

Viewer • Updated Dec 11, 2024 • 32.7k • 146 • 28

authored a paper 4 months ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 49

upvoted 2 papers 4 months ago

Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language

Paper • 2406.20085 • Published Jun 28, 2024 • 12

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 49

upvoted a paper 9 months ago

HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation

Paper • 2407.17438 • Published Jul 24, 2024 • 25

authored a paper 9 months ago

HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation

Paper • 2407.17438 • Published Jul 24, 2024 • 25

liked a model 9 months ago

Leoxing/Live2Diff

Text-to-Video • Updated Jul 17, 2024 • 11

liked a Space 9 months ago

Live2Diff

🦀

authored a paper 9 months ago

Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models

Paper • 2407.08701 • Published Jul 11, 2024 • 12

upvoted a paper 9 months ago

Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models

Paper • 2407.08701 • Published Jul 11, 2024 • 12

liked a Space 10 months ago

741

Omost

😻

Generate images from text prompts using AI

liked a model 10 months ago

Gaojunyao/StyleShot

Text-to-Image • Updated Sep 9, 2024 • 14