4 32 218

PeijieDong

pprp

https://pprp.github.io

AI & ML interests

Model Compression; Large Language Model;

Recent Activity

liked a model about 14 hours ago

deepseek-ai/DeepSeek-V4-Pro

liked a model 8 days ago

Qwen/Qwen3.6-35B-A3B

upvoted a paper about 2 months ago

Can LLMs Maintain Fundamental Abilities under KV Cache Compression?

View all activity

Organizations

None yet

liked a model about 14 hours ago

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated about 8 hours ago • 30 • 2.29k

liked a model 8 days ago

Qwen/Qwen3.6-35B-A3B

Image-Text-to-Text • 36B • Updated about 15 hours ago • 861k • 1.38k

liked a dataset 4 months ago

Gen-Verse/Open-AgentRL-30K

Viewer • Updated Oct 14, 2025 • 30.1k • 185 • 8

liked a model 4 months ago

nvidia/Nemotron-Flash-1B

Text Generation • 1.0B • Updated Jan 9 • 15.5k • 28

liked 2 datasets 4 months ago

Idavidrein/gpqa

Benchmark • Updated Mar 5 • 1.25k • 102k • 419

nvidia/Llama-Nemotron-VLM-Dataset-v1

Viewer • Updated Oct 22, 2025 • 2.86M • 1.05k • 159

liked 2 datasets 5 months ago

allenai/olmo-mix-1124

Viewer • Updated Aug 19, 2025 • 621M • 27.9k • 87

OptimalScale/ClimbLab

Viewer • Updated May 4, 2025 • 1.24B • 8.46k • 13

liked a Space 6 months ago

The Smol Training Playbook

📚

3.12k

The secrets to building world-class LLMs

liked a model 6 months ago

inclusionAI/Ring-flash-linear-2.0

Text Generation • 104B • Updated Oct 23, 2025 • 40 • 99

liked a dataset 6 months ago

InternSVG/SArena

Viewer • Updated Feb 3 • 14k • 201 • 8

liked 2 models 6 months ago

inclusionAI/Ring-lite-linear-preview

Text Generation • 17B • Updated Aug 18, 2025 • 37 • 39

microsoft/UserLM-8b

Text Generation • 8B • Updated Oct 9, 2025 • 1.17k • 365

liked a model 7 months ago

deepseek-ai/DeepSeek-V3.2-Exp

Text Generation • Updated Nov 18, 2025 • 185k • • 990

liked a model 8 months ago

nvidia/Nemotron-Research-Reasoning-Qwen-1.5B

Text Generation • Updated Nov 21, 2025 • 2.4k • 241

liked a dataset 8 months ago

nvidia/AceReason-1.1-SFT

Viewer • Updated Jun 18, 2025 • 3.96M • 5.29k • 99

liked a dataset 9 months ago

nvidia/AceReason-Math

Viewer • Updated Jun 18, 2025 • 49.6k • 1.21k • 53

liked a Space 9 months ago

GPT-OSS-120B on AMD MI300X

💻

334

gpt-oss-120b on AMD MI300X GPUs

liked 2 models 9 months ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 3.67M • • 4.73k

ByteDance-Seed/BAGEL-7B-MoT

Any-to-Any • 15B • Updated Jan 9 • 9.27k • 1.2k