Zijian Zhou's picture

Zijian Zhou PRO

franciszzj

·

https://sites.google.com/view/zijian-zhou/home

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

MoCha: Towards Movie-Grade Talking Character Synthesis

upvoted a paper 4 days ago

TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes

upvoted a collection 14 days ago

View all activity

Organizations

None yet

franciszzj's activity

upvoted 2 papers 4 days ago

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published 8 days ago • 91

TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes

Paper • 2503.23461 • Published 7 days ago • 87

upvoted a collection 14 days ago

FLUX.1

A collection of our FLUX.1 models and LoRAs. • 8 items • Updated Jan 31 • 53

upvoted a paper 24 days ago

VACE: All-in-One Video Creation and Editing

Paper • 2503.07598 • Published 27 days ago • 44

upvoted a paper 28 days ago

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published Mar 5 • 40

upvoted 3 papers about 1 month ago

MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

Paper • 2502.19634 • Published Feb 26 • 63

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 140

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 179

upvoted a paper 3 months ago

Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation

Paper • 2501.04144 • Published Jan 7 • 19

upvoted a collection 3 months ago

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 209

upvoted a paper 4 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 364

upvoted a collection 4 months ago

AI Paper of the Day

A collection of papers that I think are interesting, one added each day • 323 items • Updated 2 days ago • 41

upvoted 2 papers 4 months ago

LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations

Paper • 2412.08580 • Published Dec 11, 2024 • 46

Learning Flow Fields in Attention for Controllable Person Image Generation

Paper • 2412.08486 • Published Dec 11, 2024 • 37

upvoted a paper 5 months ago

MarDini: Masked Autoregressive Diffusion for Video Generation at Scale

Paper • 2410.20280 • Published Oct 26, 2024 • 23

upvoted a paper 6 months ago

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17, 2024 • 96

upvoted a collection 7 months ago

Playground v2

Collection of Playground v2 models • 4 items • Updated Dec 6, 2023 • 7

upvoted 2 papers 9 months ago

OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models

Paper • 2407.11213 • Published Jul 15, 2024 • 3

OutfitAnyone: Ultra-high Quality Virtual Try-On for Any Clothing and Any Person

Paper • 2407.16224 • Published Jul 23, 2024 • 28