1 16 26

larry

szh

AI & ML interests

None yet

Recent Activity

upvoted a collection 20 days ago

PaliGemma 2 Release

upvoted a collection 23 days ago

Sana

liked a model about 1 month ago

trojblue/sdxl-finetune-pen-feel

View all activity

Organizations

szh's activity

upvoted a collection 20 days ago

PaliGemma 2 Release

Collection

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated 12 days ago • 119

upvoted a collection 23 days ago

Sana

Collection

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 17 items • Updated 5 days ago • 58

liked a model about 1 month ago

trojblue/sdxl-finetune-pen-feel

Updated Oct 28, 2023 • 1

upvoted a paper 2 months ago

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Paper • 2410.10812 • Published Oct 14 • 15

liked a Space 2 months ago

Running

👀

Text To Anime Arena

upvoted 2 papers 3 months ago

Progressive Autoregressive Video Diffusion Models

Paper • 2410.08151 • Published Oct 10 • 15

PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation

Paper • 2409.18964 • Published Sep 27 • 25

New activity in playgroundai/CapsBench 3 months ago

There are 136 rows where ‘image’ is None in data. Please correct this.

#2 opened 3 months ago by

szh

updated a model 3 months ago

incantor/image_complexity_ic9600

Updated Sep 14 • 1

liked a model 4 months ago

nyanko7/flux-dev-anime-cg

Text-to-Image • Updated Aug 17 • 20

liked a Space 4 months ago

Running on Zero

🦀

Flux1 Dev NF4

updated a collection 4 months ago

prompt-helper

Collection

3 items • Updated Aug 16

upvoted a paper 4 months ago

Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models

Paper • 2408.04594 • Published Aug 8 • 14

updated 2 datasets 5 months ago

szh/ai_images

Updated Aug 1 • 5

szh/ai_human_eval

Viewer • Updated Jul 17 • 11.8k • 10

updated a model 5 months ago

szh/mps-pth

Updated Jul 16

liked a model 6 months ago

ostris/vae-kl-f8-d16

Updated Jul 21 • 69 • 71

liked a dataset 6 months ago

CaptionEmporium/coyo-hd-11m-llavanext

Viewer • Updated Jul 6 • 11.4M • 291 • 24

upvoted a paper 6 months ago

Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs

Paper • 2406.16860 • Published Jun 24 • 59