zhuzeyuan
's Collections
"Physics of Language Models" series
updated
Physics of Language Models: Part 1, Context-Free Grammar
Paper
•
2305.13673
•
Published
•
7
Physics of Language Models: Part 3.2, Knowledge Manipulation
Paper
•
2309.14402
•
Published
•
7
Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws
Paper
•
2404.05405
•
Published
•
10
Physics of Language Models: Part 3.1, Knowledge Storage and Extraction
Paper
•
2309.14316
•
Published
•
8
Physics of Language Models: Part 2.1, Grade-School Math and the Hidden
Reasoning Process
Paper
•
2407.20311
•
Published
•
5
Physics of Language Models: Part 2.2, How to Learn From Mistakes on
Grade-School Math Problems
Paper
•
2408.16293
•
Published
•
25