2 15 2

Tongyao PRO

tyzhu

tongyao-zhu

AI & ML interests

Natural Language Processing

Recent Activity

updated a model 7 days ago

tyzhu/litgpt_pretrain_arxiv_jan26

published a model 8 days ago

tyzhu/litgpt_pretrain_arxiv_jan26

updated a model 10 days ago

tyzhu/proweb-checkpoints

View all activity

Organizations

None yet

Collections 8

View 8 collections

Papers 6

models 18

datasets 114

tyzhu/mathpros1

Updated 13 days ago • 107

tyzhu/mathpros10

Viewer • Updated 13 days ago • 1.05M • 86

tyzhu/megamathpromax-shuffled

Updated 19 days ago • 85

tyzhu/megamath-web-pro-max-splitted

Viewer • Updated 25 days ago • 48.3M • 51

tyzhu/mathpro_train

Viewer • Updated Nov 26, 2025 • 1 • 7

tyzhu/opcd-10percent

Viewer • Updated Nov 11, 2025 • 49M • 2

tyzhu/opcder-batch

Viewer • Updated Nov 9, 2025 • 472M • 110

tyzhu/SPA-sokoban-data

Viewer • Updated Oct 29, 2025 • 6.01k • 51

tyzhu/SPA-frozenlake-data

Viewer • Updated Oct 29, 2025 • 4.22k • 10

tyzhu/SPA-sudoku-data

Viewer • Updated Oct 29, 2025 • 7.54k • 15

View 114 datasets

Tongyao PRO

AI & ML interests

Recent Activity

Organizations

Collections 8

Law of Vision Representation in MLLMs

LongRoPE2: Near-Lossless LLM Context Window Scaling

Law of Vision Representation in MLLMs

LongRoPE2: Near-Lossless LLM Context Window Scaling

Papers 6

models 18

tyzhu/litgpt_pretrain_arxiv_jan26

tyzhu/proweb-checkpoints

tyzhu/opencoder484

tyzhu/opencoder-1.5b-pystack80-opcanneal20-50ksteps

tyzhu/sokoban-1.5b-coord-baseline-rl1000

tyzhu/opencoder-1.5b-oppt80-opcanneal20-25ksteps-4nodes-4k

tyzhu/olmo-1b-finecode-5ksteps

tyzhu/webinsv1clear-grpo-qwen3-4b

tyzhu/SPA-frozenlake-qwen2.5-1.5b-instruct

tyzhu/SPA-sudoku-qwen2.5-1.5b-instruct

datasets 114

tyzhu/mathpros1

tyzhu/mathpros10

tyzhu/megamathpromax-shuffled

tyzhu/megamath-web-pro-max-splitted

tyzhu/mathpro_train

tyzhu/opcd-10percent

tyzhu/opcder-batch

tyzhu/SPA-sokoban-data

tyzhu/SPA-frozenlake-data

tyzhu/SPA-sudoku-data

Tongyao PRO

AI & ML interests

Recent Activity

Organizations

Collections 8

Papers 6

models 18 Sort: Recently updated

datasets 114 Sort: Recently updated

models 18

datasets 114