duweining's picture

25 4

duweining

duringwei

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

upvoted a paper about 2 months ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

liked a model about 2 months ago

protectai/test-public-repo

View all activity

Organizations

None yet

duringwei's activity

upvoted a paper about 1 month ago

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

Paper • 2503.07605 • Published Mar 10 • 66

upvoted a paper about 2 months ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published Feb 13 • 193

liked 4 models about 2 months ago

protectai/test-public-repo

Updated 8 minutes ago • 39.6k • 1

unsloth/DeepSeek-R1-GGUF

Text Generation • Updated Feb 13 • 2.01M • 1.03k

microsoft/OmniParser-v2.0

Updated 19 days ago • 2.57k • 1.22k

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated Feb 1 • 242k • 3.33k

upvoted 14 papers about 2 months ago

ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning

Paper • 2410.17779 • Published Oct 23, 2024 • 9

Value Residual Learning For Alleviating Attention Concentration In Transformers

Paper • 2410.17897 • Published Oct 23, 2024 • 9

Language Models are Symbolic Learners in Arithmetic

Paper • 2410.15580 • Published Oct 21, 2024 • 8

The Nature of Mathematical Modeling and Probabilistic Optimization Engineering in Generative AI

Paper • 2410.18441 • Published Oct 24, 2024 • 7

Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models

Paper • 2410.18252 • Published Oct 23, 2024 • 7

Should We Really Edit Language Models? On the Evaluation of Edited Language Models

Paper • 2410.18785 • Published Oct 24, 2024 • 7

ZIP-FIT: Embedding-Free Data Selection via Compression-Based Alignment

Paper • 2410.18194 • Published Oct 23, 2024 • 6

Data Scaling Laws in Imitation Learning for Robotic Manipulation

Paper • 2410.18647 • Published Oct 24, 2024 • 6

Pantograph: A Machine-to-Machine Interaction Interface for Advanced Theorem Proving, High Level Reasoning, and Data Extraction in Lean 4

Paper • 2410.16429 • Published Oct 21, 2024 • 5

Multi-Draft Speculative Sampling: Canonical Architectures and Theoretical Limits

Paper • 2410.18234 • Published Oct 23, 2024 • 5

Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance

Paper • 2410.13816 • Published Oct 17, 2024 • 2

ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding

Paper • 2410.13924 • Published Oct 17, 2024 • 7

TP-Eval: Tap Multimodal LLMs' Potential in Evaluation by Customizing Prompts

Paper • 2410.18071 • Published Oct 23, 2024 • 7

LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias

Paper • 2410.17242 • Published Oct 22, 2024 • 5