Zikai Zhou

Klayand

2 84 17

https://klayand.github.io/

Klayand

AI & ML interests

Knowledge Distillation, Generated Models

Recent Activity

upvoted a paper about 24 hours ago

Qwen-Image-2.0-RL Technical Report

upvoted a paper 4 days ago

Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation

upvoted a paper 5 days ago

Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models

View all activity

Organizations

None yet

upvoted a paper about 24 hours ago

Qwen-Image-2.0-RL Technical Report

Paper • 2606.27608 • Published 5 days ago • 33

upvoted a paper 4 days ago

Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation

Paper • 2606.26907 • Published 5 days ago • 46

upvoted a paper 5 days ago

Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models

Paper • 2606.25041 • Published 7 days ago • 103

upvoted 3 papers 14 days ago

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

Paper • 2606.14777 • Published 20 days ago • 206

UniDDT: Unifying Multimodal Understanding and Generation with Decoupled Diffusion Transformer

Paper • 2606.16255 • Published 15 days ago • 14

Qwen-RobotWorld Technical Report: Unifying Embodied World Modeling through Language-Conditioned Video Generation

Paper • 2606.17030 • Published 15 days ago • 32

upvoted a paper 19 days ago

Beyond Scalar Rewards by Internalizing Reasoning into Score Distributions

Paper • 2606.09076 • Published 22 days ago • 63

upvoted 2 papers 20 days ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Paper • 2606.11025 • Published 21 days ago • 41

Kwai Keye-VL-2.0 Technical Report

Paper • 2606.10651 • Published 21 days ago • 192

upvoted 2 papers 26 days ago

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published 29 days ago • 136

Qwen-Image-Flash: Beyond Objective Design

Paper • 2606.03746 • Published 28 days ago • 36

upvoted 2 papers 29 days ago

SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer

Paper • 2605.30409 • Published May 28 • 41

Representation Forcing for Bottleneck-Free Unified Multimodal Models

Paper • 2605.31604 • Published May 29 • 63

upvoted 4 papers about 1 month ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Paper • 2605.30280 • Published May 28 • 146

upvoted 3 papers about 2 months ago

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

Paper • 2605.15178 • Published May 14 • 91

Qwen-Image-VAE-2.0 Technical Report

Paper • 2605.13565 • Published May 13 • 62

Qwen-Image-2.0 Technical Report

Paper • 2605.10730 • Published May 11 • 115

Zikai Zhou

AI & ML interests

Recent Activity

Organizations

Klayand's activity