1 1 14

wang

zhaokai

gklab

AI & ML interests

None yet

Recent Activity

liked a dataset 17 days ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k

liked a Space 22 days ago

nanotron/ultrascale-playbook

liked a model about 2 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

View all activity

Organizations

zhaokai's activity

liked a dataset 17 days ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k

Viewer • Updated 21 days ago • 110k • 7.74k • 522

liked a Space 22 days ago

2.24k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked 2 models about 2 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 18 days ago • 1.59M • • 1.26k

deepseek-ai/DeepSeek-R1

Text Generation • Updated 18 days ago • 2.75M • • 11.3k

liked a model 3 months ago

deepseek-ai/DeepSeek-V3-Base

Updated 18 days ago • 762k • 1.59k

upvoted a collection 6 months ago

Qwen2-VL

Collection

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 208

liked 2 models 7 months ago

microsoft/Phi-3.5-MoE-instruct

Text Generation • Updated 6 days ago • 41k • • 555

Qwen/Qwen2-Audio-7B-Instruct

Audio-Text-to-Text • Updated Jan 12 • 146k • • 369

liked a model 8 months ago

meta-llama/Prompt-Guard-86M

Text Classification • Updated Jul 25, 2024 • 28k • • 233

liked a model 9 months ago

openbmb/MiniCPM-Llama3-V-2_5

Image-Text-to-Text • Updated Jan 15 • 25k • 1.39k

liked a Space 9 months ago

869

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

liked a dataset about 1 year ago

Skywork/SkyPile-150B

Viewer • Updated Dec 7, 2023 • 1.76M • 3.75k • 364

New activity in SkunkworksAI/phi-2 over 1 year ago

Update config.json

#7 opened over 1 year ago by

zhaokai

liked 3 models over 1 year ago