1 25 146

peng

superpeng

AI & ML interests

None yet

Recent Activity

liked a dataset 13 days ago

Flmc/DISC-Med-SFT

liked a dataset 15 days ago

simplescaling/s1K-1.1

liked a dataset 16 days ago

GAIR/LIMO

View all activity

Organizations

None yet

superpeng's activity

liked a dataset 13 days ago

Flmc/DISC-Med-SFT

Viewer • Updated Aug 29, 2023 • 465k • 235 • 90

liked a dataset 15 days ago

simplescaling/s1K-1.1

Viewer • Updated Feb 27 • 1k • 6.04k • 103

liked a dataset 16 days ago

GAIR/LIMO

Viewer • Updated Feb 10 • 817 • 4.9k • 147

liked a dataset 26 days ago

CharlieDreemur/OpenManus-RL

Viewer • Updated 22 days ago • 48.9k • 2.2k • 44

upvoted a paper about 1 month ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25 • 72

liked a dataset about 1 month ago

open-thoughts/OpenThoughts-114k

Viewer • Updated Feb 20 • 228k • 25.5k • 680

upvoted a collection about 1 month ago

Phi-4

Collection

Phi-4 family of small language and multi-modal models. • 7 items • Updated Mar 3 • 112

liked a model about 1 month ago

microsoft/Phi-4-mini-instruct

Text Generation • Updated 26 days ago • 345k • • 420

liked 2 datasets about 1 month ago

FreedomIntelligence/Medical-R1-Distill-Data

Viewer • Updated Feb 22 • 22k • 1.44k • 34

jdh-algo/Citrus_S3

Preview • Updated Feb 27 • 467 • 8

liked a model about 1 month ago

baichuan-inc/Baichuan-M1-14B-Instruct

Updated Feb 20 • 26.6k • 52

liked 2 datasets about 1 month ago

FreedomIntelligence/medical-o1-verifiable-problem

Viewer • Updated Dec 30, 2024 • 40.6k • 1.15k • 84

Congliu/Chinese-DeepSeek-R1-Distill-data-110k-SFT

Viewer • Updated Feb 19 • 110k • 3.78k • 159

liked a dataset about 2 months ago

SPIRAL-MED/o1-journey-Ophiuchus

Viewer • Updated Jan 15 • 5.31k • 34 • 11

upvoted a collection about 2 months ago

DeepSeek-R1-ReDistill

Collection

Re-distilled DeepSeek R1 models • 4 items • Updated Jan 30 • 14

liked 2 datasets about 2 months ago

mlfoundations-dev/filtered_numina_R1

Viewer • Updated Jan 23 • 34.3k • 86 • 6

ServiceNow-AI/R1-Distill-SFT

Viewer • Updated Feb 8 • 1.85M • 2.94k • 293

liked 3 datasets 3 months ago