3 226 334

zhangwenbin

ExceedZhang

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Qwen3-ASR Technical Report

upvoted an article 2 days ago

We Got Claude to Build CUDA Kernels and teach open models!

liked a model 4 days ago

thebajajra/RexReranker-0.6B

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Qwen3-ASR Technical Report

Paper • 2601.21337 • Published 3 days ago • 21

upvoted an article 2 days ago

Article

We Got Claude to Build CUDA Kernels and teach open models!

4 days ago

•

102

upvoted a paper 6 days ago

Perception Encoder: The best visual embeddings are not at the output of the network

Paper • 2504.13181 • Published Apr 17, 2025 • 35

upvoted a paper 8 days ago

Qwen3-TTS Technical Report

Paper • 2601.15621 • Published 10 days ago • 56

upvoted 3 papers 13 days ago

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published 17 days ago • 189

X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests

Paper • 2601.06953 • Published 21 days ago • 44

User-Oriented Multi-Turn Dialogue Generation with Tool Use at scale

Paper • 2601.08225 • Published 19 days ago • 51

upvoted 2 articles 16 days ago

Article

Introducing OptiMind, a research model designed for optimization

16 days ago

•

Article

Open Responses: What you need to know

17 days ago

•

101

upvoted a paper 21 days ago

How to Correctly Report LLM-as-a-Judge Evaluations

Paper • 2511.21140 • Published Nov 26, 2025 • 1

upvoted an article 28 days ago

Article

New in llama.cpp: Model Management

Dec 11, 2025

•

116

upvoted 2 papers about 1 month ago

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

Paper • 2512.12967 • Published Dec 15, 2025 • 108

NVIDIA Nemotron 3: Efficient and Open Intelligence

Paper • 2512.20856 • Published Dec 24, 2025 • 36

upvoted 2 articles about 1 month ago

Article

AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems

Dec 23, 2025

•

Article

使用 NVIDIA Isaac 构建医疗机器人：从仿真到部署

Oct 29, 2025

•

upvoted a collection about 1 month ago

sam-audio

Collection

11 items • Updated Dec 16, 2025 • 123

upvoted an article about 1 month ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

Dec 18, 2025

•

119

upvoted an article about 2 months ago

Article

Codex is Open Sourcing AI models

Dec 11, 2025

•

upvoted a paper about 2 months ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 152

upvoted an article about 2 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

586

zhangwenbin

AI & ML interests

Recent Activity

Organizations

ExceedZhang's activity

We Got Claude to Build CUDA Kernels and teach open models!

Introducing OptiMind, a research model designed for optimization

Open Responses: What you need to know

New in llama.cpp: Model Management

AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems

使用 NVIDIA Isaac 构建医疗机器人：从仿真到部署

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

Codex is Open Sourcing AI models

We Got Claude to Fine-Tune an Open Source LLM