ldwang's picture

ldwang

ldwang

·

ftgreat

AI & ML interests

LLM, MLLM, Infra

Recent Activity

updated a model about 9 hours ago

BAAI/OpenSeek-Mid-v1

liked a model 3 days ago

BAAI/OpenSeek-Mid-v1

liked a model 13 days ago

deepseek-ai/DeepSeek-V4-Flash

View all activity

Organizations

upvoted a collection 22 days ago

Qwen3.6

4 items • Updated 16 days ago • 319

upvoted a paper 23 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 24 days ago • 90

upvoted a paper 27 days ago

Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes

Paper • 2603.25562 • Published Mar 26 • 15

upvoted an article about 1 month ago

Article

Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️

Jan 4, 2025

•

9

upvoted a collection about 2 months ago

UltraData

Ultra Scale, Ultra Quality, Ultra Coverage • 10 items • Updated 21 days ago • 81

upvoted 2 papers about 2 months ago

Data Science and Technology Towards AGI Part I: Tiered Data Management

Paper • 2602.09003 • Published Feb 9 • 7

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published Feb 11 • 196

upvoted a collection about 2 months ago

Open Coding Agents Specialization

Ai2 Open Coding Agents - Django, Sphinx, Sympy Data • 6 items • Updated Feb 11 • 5

upvoted 3 papers about 2 months ago

Qwen3-Coder-Next Technical Report

Paper • 2603.00729 • Published Feb 28 • 64

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Paper • 2602.24286 • Published Feb 27 • 98

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 194

upvoted a paper 2 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 340

upvoted an article 3 months ago

Article

Automated Discovery of High-Performance GPU Kernels with OpenEvolve

Jun 27, 2025

•

26

upvoted 5 papers 4 months ago

Towards Automated Kernel Generation in the Era of LLMs

Paper • 2601.15727 • Published Jan 22 • 19

VQ-Seg: Vector-Quantized Token Perturbation for Semi-Supervised Medical Image Segmentation

Paper • 2601.10124 • Published Jan 15 • 4

SWE-smith: Scaling Data for Software Engineering Agents

Paper • 2504.21798 • Published Apr 30, 2025 • 15

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Paper • 2512.24873 • Published Dec 31, 2025 • 109

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 324

upvoted a collection 4 months ago

Molmo2 Data

Artifacts for the Molmo2 data release • 13 items • Updated Mar 2 • 40

upvoted a paper 5 months ago

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

Paper • 2512.02551 • Published Dec 2, 2025 • 13