Zengzhi Wang's picture

Zengzhi Wang

SinclairWang

·

https://tinyurl.com/zengzhi-homepage

AI & ML interests

Data Engineering for Generative AI

Recent Activity

liked a dataset 10 days ago

GAIR/daVinci-Dev

upvoted a paper about 1 month ago

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

upvoted a paper about 2 months ago

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

View all activity

Organizations

upvoted a paper about 1 month ago

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

Paper • 2512.23576 • Published Dec 29, 2025 • 65

upvoted 2 papers about 2 months ago

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

Paper • 2512.15745 • Published Dec 10, 2025 • 83

Exploration v.s. Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward

Paper • 2512.16912 • Published Dec 18, 2025 • 12

upvoted a paper 2 months ago

GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization

Paper • 2511.15705 • Published Nov 19, 2025 • 97

upvoted 4 papers 3 months ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29, 2025 • 46

Context Engineering 2.0: The Context of Context Engineering

Paper • 2510.26493 • Published Oct 30, 2025 • 8

DeepSeek-OCR: Contexts Optical Compression

Paper • 2510.18234 • Published Oct 21, 2025 • 91

olmOCR 2: Unit Test Rewards for Document OCR

Paper • 2510.19817 • Published Oct 22, 2025 • 16

upvoted a paper 4 months ago

FineVision: Open Data Is All You Need

Paper • 2510.17269 • Published Oct 20, 2025 • 75

upvoted an article 4 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

753

upvoted a paper 4 months ago

LIMI: Less is More for Agency

Paper • 2509.17567 • Published Sep 22, 2025 • 104

upvoted a paper 6 months ago

We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning

Paper • 2508.10433 • Published Aug 14, 2025 • 144

upvoted 4 collections 6 months ago

ProX General Models

base models trained on ProX curated data. • 16 items • Updated Oct 10, 2024 • 1

ProX Math Models

base models trained on ProX curated openwebmath-pro. • 5 items • Updated Oct 10, 2024 • 1

ProX Refining Models

Adapted small language models used to generate data refining programs • 5 items • Updated Oct 10, 2024 • 5

Qwen3

84 items • Updated Dec 31, 2025 • 1.63k

upvoted 3 papers 7 months ago

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Paper • 2507.16812 • Published Jul 22, 2025 • 63

Scaling Laws for Optimal Data Mixtures

Paper • 2507.09404 • Published Jul 12, 2025 • 37

ZeCO: Zero Communication Overhead Sequence Parallelism for Linear Attention

Paper • 2507.01004 • Published Jul 1, 2025 • 10

upvoted a collection 7 months ago

OctoThinker-Llama-1B Family

What makes a base language model suitable for RL? Through controlled experiments, we identify key factors then leverage them to scale up mid-training. • 6 items • Updated Jul 6, 2025 • 2