siyeng feng

siyengfeng

AI & ML interests

None yet

Recent Activity

liked a model about 19 hours ago

meta-llama/Llama-4-Maverick-17B-128E-Instruct

liked a model about 19 hours ago

all-hands/openhands-lm-32b-v0.1

liked a model about 19 hours ago

meta-llama/Llama-4-Scout-17B-16E-Instruct

View all activity

Organizations

None yet

siyengfeng's activity

upvoted 5 papers about 19 hours ago

Do PhD-level LLMs Truly Grasp Elementary Addition? Probing Rule Learning vs. Memorization in Large Language Models

Paper • 2504.05262 • Published 8 days ago • 7

Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs

Paper • 2504.07866 • Published 5 days ago • 7

ModernBERT or DeBERTaV3? Examining Architecture and Data Influence on Transformer Encoder Models Performance

Paper • 2504.08716 • Published 4 days ago • 7

SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning

Paper • 2504.08600 • Published 4 days ago • 18

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published 4 days ago • 98

upvoted 15 papers 1 day ago

Towards Visual Text Grounding of Multimodal Large Language Model

Paper • 2504.04974 • Published 8 days ago • 11

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

Paper • 2504.07934 • Published 5 days ago • 14

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

Paper • 2504.07951 • Published 5 days ago • 20

C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing

Paper • 2504.07964 • Published 5 days ago • 58

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published 14 days ago • 72

Kimi-VL Technical Report

Paper • 2504.07491 • Published 5 days ago • 108

VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning

Paper • 2504.06958 • Published 6 days ago • 9

Fast Controlled Generation from Language Models with Adaptive Weighted Rejection Sampling

Paper • 2504.05410 • Published 8 days ago • 2

Self-Steering Language Models

Paper • 2504.07081 • Published 6 days ago • 15

A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility

Paper • 2504.07086 • Published 6 days ago • 17

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published 6 days ago • 66

Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?

Paper • 2504.06514 • Published 7 days ago • 32

Leanabell-Prover: Posttraining Scaling in Formal Reasoning

Paper • 2504.06122 • Published 7 days ago • 5

CrossWordBench: Evaluating the Reasoning Capabilities of LLMs and LVLMs with Controllable Puzzle Generation

Paper • 2504.00043 • Published 16 days ago • 8

Efficient Reinforcement Finetuning via Adaptive Curriculum Learning

Paper • 2504.05520 • Published 8 days ago • 8