Le Zhuo's picture

7 12 9

Le Zhuo

JackyZhuo

·

AI & ML interests

None yet

Recent Activity

updated a model about 10 hours ago

diffusion-cot/experimental-valdata

published a model about 10 hours ago

diffusion-cot/experimental-valdata

View all activity

Organizations

JackyZhuo's activity

upvoted a collection about 1 month ago

Open Image Preferences

Containing all artifacts for the Stable Diffusion 3.5L vs Flux Dev image preference community sprint. • 14 items • Updated Dec 19, 2024 • 9

upvoted a paper about 2 months ago

IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models

Paper • 2501.13920 • Published Jan 23 • 15

upvoted a paper 2 months ago

Cosmos World Foundation Model Platform for Physical AI

Paper • 2501.03575 • Published Jan 7 • 69

upvoted a paper 3 months ago

Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation

Paper • 2412.09428 • Published Dec 12, 2024 • 7

upvoted a paper 4 months ago

VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection

Paper • 2411.14794 • Published Nov 22, 2024 • 13

upvoted a paper 6 months ago

PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions

Paper • 2409.15278 • Published Sep 23, 2024 • 25

upvoted 2 papers 7 months ago

LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation

Paper • 2408.15881 • Published Aug 28, 2024 • 21

Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining

Paper • 2408.02657 • Published Aug 5, 2024 • 34

upvoted 4 papers over 1 year ago

3D-GPT: Procedural 3D Modeling with Large Language Models

Paper • 2310.12945 • Published Oct 19, 2023 • 59

Brain2Music: Reconstructing Music from Human Brain Activity

Paper • 2307.11078 • Published Jul 20, 2023 • 41

HyenaDNA: Long-Range Genomic Sequence Modeling at Single Nucleotide Resolution

Paper • 2306.15794 • Published Jun 27, 2023 • 17

Language-Guided Music Recommendation for Video via Prompt Analogies

Paper • 2306.09327 • Published Jun 15, 2023 • 8