wang's picture

79 2

wang

wangxbx

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

A Comprehensive Survey on Long Context Language Modeling

upvoted a paper 10 days ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

upvoted a paper 10 days ago

Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

View all activity

Organizations

None yet

wangxbx's activity

upvoted 6 papers 10 days ago

A Comprehensive Survey on Long Context Language Modeling

Paper • 2503.17407 • Published 16 days ago • 48

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published 16 days ago • 46

Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

Paper • 2503.15558 • Published 18 days ago • 44

Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models

Paper • 2503.16257 • Published 16 days ago • 23

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

Paper • 2503.16419 • Published 16 days ago • 65

Survey on Evaluation of LLM-based Agents

Paper • 2503.16416 • Published 16 days ago • 82

upvoted 4 papers 14 days ago

XAttention: Block Sparse Attention with Antidiagonal Scoring

Paper • 2503.16428 • Published 16 days ago • 12

Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts

Paper • 2503.16057 • Published 16 days ago • 14

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published 18 days ago • 112

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published 18 days ago • 134

upvoted 2 papers 23 days ago

EAGLE-3: Scaling up Inference Acceleration of Large Language Models via Training-Time Test

Paper • 2503.01840 • Published Mar 3 • 4

Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model

Paper • 2503.07703 • Published 26 days ago • 34

upvoted 2 papers 27 days ago

Identifying Sensitive Weights via Post-quantization Integral

Paper • 2503.01901 • Published Feb 28 • 7

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published 30 days ago • 103

liked a model about 1 month ago

Qwen/QwQ-32B

Text Generation • Updated 25 days ago • 852k • • 2.63k

upvoted 5 papers about 1 month ago

DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting

Paper • 2503.00784 • Published Mar 2 • 11

Speculative Ad-hoc Querying

Paper • 2503.00714 • Published Mar 2 • 12

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published Mar 3 • 81

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

Paper • 2502.14856 • Published Feb 20 • 7

Iterative Value Function Optimization for Guided Decoding

Paper • 2503.02368 • Published Mar 4 • 14