Bingzheng Wei's picture

107 48

Bingzheng Wei

Bingzheng

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

upvoted a paper 1 day ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

upvoted a paper 1 day ago

BitNet b1.58 2B4T Technical Report

View all activity

Organizations

None yet

Bingzheng's activity

upvoted 3 papers 1 day ago

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

Paper • 2504.11468 • Published 9 days ago • 20

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published 4 days ago • 44

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published 3 days ago • 51

upvoted 5 papers 3 days ago

Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding

Paper • 2504.10465 • Published 5 days ago • 26

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

Paper • 2504.08672 • Published 8 days ago • 50

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Paper • 2504.10481 • Published 5 days ago • 77

Heimdall: test-time scaling on the generative verification

Paper • 2504.10337 • Published 5 days ago • 29

How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients

Paper • 2504.10766 • Published 4 days ago • 36

upvoted 3 papers 4 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 5 days ago • 223

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published 12 days ago • 112

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Paper • 2504.08837 • Published 9 days ago • 39

upvoted 4 papers 5 days ago

GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation

Paper • 2504.08736 • Published 8 days ago • 44

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published 8 days ago • 117

C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing

Paper • 2504.07964 • Published 9 days ago • 58

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published 17 days ago • 79

upvoted a paper 8 days ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published 9 days ago • 113

upvoted 4 papers 9 days ago

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published 10 days ago • 69

Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

Paper • 2504.05599 • Published 11 days ago • 79

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published 11 days ago • 101

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published 11 days ago • 143