wen's picture

2 6

wen

zhengwenzhen

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 21 hours ago

Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining

liked a Space 4 days ago

nanotron/ultrascale-playbook

upvoted an article 6 months ago

Recreating o1 at Home with Role-Play LLMs

View all activity

Organizations

zhengwenzhen's activity

upvoted a paper about 21 hours ago

Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining

Paper • 2503.04715 • Published 8 days ago • 1

liked a Space 4 days ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 6 months ago

Article

Recreating o1 at Home with Role-Play LLMs

By

•

Sep 20, 2024

• 23

liked 2 datasets 7 months ago

TIGER-Lab/WebInstructSub

Viewer • Updated Oct 27, 2024 • 2.34M • 2.19k • 146

fka/awesome-chatgpt-prompts

Viewer • Updated Jan 6 • 203 • 12.4k • 7.62k

liked a model 12 months ago

wenbopan/Faro-Yi-9B

Text Generation • Updated Apr 23, 2024 • 3.63k • 16

liked a dataset 12 months ago

wenbopan/Fusang-v1

Viewer • Updated Mar 20, 2024 • 1.34M • 215 • 14

liked a model over 1 year ago

BAAI/bge-large-en-v1.5

Feature Extraction • Updated Feb 21, 2024 • 1.86M • • 495