-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper โข 2402.17764 โข Published โข 612 -
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method
Paper โข 2402.17193 โข Published โข 25 -
Training-Free Long-Context Scaling of Large Language Models
Paper โข 2402.17463 โข Published โข 23 -
The Power of Scale for Parameter-Efficient Prompt Tuning
Paper โข 2104.08691 โข Published โข 10
Hao-Yuan Chen
MarkChenX
ยท
AI & ML interests
Deep Learning, Foundational Models, Domain Adaptation, Quantum AI, LLM Reasoning, Agentic Research
Recent Activity
upvoted
a
paper
10 days ago
Verbal Process Supervision Elicits Better Coding Agents
commented on
a paper
11 days ago
Verbal Process Supervision Elicits Better Coding Agents
authored
a paper
11 days ago
Verbal Process Supervision Elicits Better Coding Agents
Organizations
Collections
1
spaces
4
models
None public yet