Shizhe Diao

shizhediao2

AI & ML interests

LLM pre-training and reasoning

Recent Activity

updated a model 2 days ago
data4elm/Llama-400M-12L
published a model 2 days ago
data4elm/Llama-400M-12L
View all activity

Organizations

NVIDIA's profile picture temp_math_data's profile picture UGPhysics's profile picture Data Filtering Challenge for Training Edge Language Models's profile picture

shizhediao2's activity

New activity in nvidia/ClimbMix 1 day ago
New activity in nvidia/ClimbMix 3 days ago

Tokenizer used

1
#5 opened 4 days ago by
gvlassis
New activity in nvidia/ClimbMix 4 days ago
New activity in nvidia/ClimbLab 5 days ago
New activity in nvidia/ClimbMix 6 days ago

Create README.md

#1 opened 6 days ago by
shizhediao2
New activity in nvidia/ClimbLab 6 days ago

Create README.md

#1 opened 6 days ago by
shizhediao2
New activity in LLM360/TxT360 5 months ago

Is the full data ready?

3
#6 opened 6 months ago by
shizhediao2
New activity in ZenMoore/RoleBench 8 months ago

multiturn roleplay

2
#2 opened over 1 year ago by
520jefferson