1 30 27

larry

szh

AI & ML interests

None yet

Recent Activity

upvoted a paper 22 days ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

upvoted a paper about 2 months ago

LeapAlign: Post-Training Flow Matching Models at Any Generation Step by Building Two-Step Trajectories

upvoted a paper 6 months ago

SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

View all activity

Organizations

upvoted a paper 22 days ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published 28 days ago • 159

upvoted a paper about 2 months ago

LeapAlign: Post-Training Flow Matching Models at Any Generation Step by Building Two-Step Trajectories

Paper • 2604.15311 • Published Apr 16 • 13

upvoted 3 papers 6 months ago

SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

Paper • 2512.11749 • Published Dec 12, 2025 • 39

OmniPSD: Layered PSD Generation with Diffusion Transformer

Paper • 2512.09247 • Published Dec 10, 2025 • 51

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 246

upvoted 2 papers 7 months ago

Defeating the Training-Inference Mismatch via FP16

Paper • 2510.26788 • Published Oct 30, 2025 • 32

LongCat-Video Technical Report

Paper • 2510.22200 • Published Oct 25, 2025 • 38

upvoted a paper 8 months ago

Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image Generation

Paper • 2510.21583 • Published Oct 24, 2025 • 31

upvoted a paper 9 months ago

DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Paper • 2509.16117 • Published Sep 19, 2025 • 23

upvoted a paper 12 months ago

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published Jun 10, 2025 • 109

upvoted a paper about 1 year ago

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7, 2025 • 110

upvoted 2 papers over 1 year ago

Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published Feb 7, 2025 • 107

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

Paper • 2502.02492 • Published Feb 4, 2025 • 66

upvoted 2 collections over 1 year ago

PaliGemma 2 Release

Collection

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated Mar 12 • 153

Sana

Collection

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 22 items • Updated Mar 10 • 105

upvoted 3 papers over 1 year ago

upvoted 2 papers almost 2 years ago

Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models

Paper • 2408.04594 • Published Aug 8, 2024 • 14

Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs

Paper • 2406.16860 • Published Jun 24, 2024 • 63

larry

AI & ML interests

Recent Activity

Organizations

szh's activity