weishen's picture

8 8 32

weishen

fakerbaby

·

fakerbaby

AI & ML interests

NLP, alignment, LLM

Recent Activity

liked a Space about 17 hours ago

opencompass/open_vlm_leaderboard

liked a dataset 2 days ago

Rapidata/700k_Human_Preference_Dataset_FLUX_SD3_MJ_DALLE3

upvoted an article 2 days ago

Open Preference Dataset for Text-to-Image Generation by the 🤗 Community

View all activity

Organizations

fakerbaby's activity

liked a Space about 17 hours ago

Open VLM Leaderboard

VLMEvalKit Evaluation Results Collection

liked a dataset 2 days ago

Rapidata/700k_Human_Preference_Dataset_FLUX_SD3_MJ_DALLE3

Viewer • Updated Jan 10 • 25.6k • 187 • 18

upvoted an article 2 days ago

Article

Open Preference Dataset for Text-to-Image Generation by the 🤗 Community

Dec 9, 2024

• 59

liked a Space 9 days ago

DeepSite

Generate any application with DeepSeek

liked a Space 10 days ago

EasyControl Ghibli

New Ghibli EasyControl model is now released!!

liked a dataset about 2 months ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k

Viewer • Updated Feb 21 • 110k • 4.24k • 622

liked a model 4 months ago

Qwen/QwQ-32B-Preview

Text Generation • Updated Jan 12 • 224k • • 1.73k

liked a dataset 4 months ago

HPAI-BSC/Aloe-Beta-Medical-Collection

Viewer • Updated 10 days ago • 102k • 51 • 3

upvoted a collection 4 months ago

Medical QA Datasets

A collection of medical question answering (QA) datasets • 23 items • Updated Feb 22 • 35

liked 4 datasets 5 months ago

yingyingzhang/metamath-qwen2-math

Viewer • Updated Oct 1, 2024 • 467k • 161 • 32

nvidia/OpenMathInstruct-2

Viewer • Updated Nov 25, 2024 • 22M • 7.39k • 165

KbsdJames/Omni-MATH

Viewer • Updated Oct 12, 2024 • 4.43k • 2.37k • 93

Skywork/Skywork-Reward-Preference-80K-v0.2

Viewer • Updated Oct 25, 2024 • 77k • 803 • 47

liked a dataset 6 months ago

AI-MO/aimo-validation-aime

Viewer • Updated Jul 10, 2024 • 90 • 9.66k • 42

reacted to onekq's post with 👍 7 months ago

Post

2574

Here is my latest study on OpenAI🍓o1🍓.
A Case Study of Web App Coding with OpenAI Reasoning Models (2409.13773)

I wrote an easy-to-read blogpost to explain finding.
https://huggingface.co/blog/onekq/daily-software-engineering-work-reasoning-models

INSTRUCTION FOLLOWING is the key.

100% instruction following + Reasoning = new SOTA

But if the model misses or misunderstands one instruction, it can perform far worse than non-reasoning models.

upvoted a collection 7 months ago

Infinity Instruct

16 items • Updated Mar 9 • 9

liked 3 datasets 7 months ago

Magpie-Align/MagpieLM-SFT-Data-v0.1

Viewer • Updated Dec 9, 2024 • 550k • 41 • 15

MARIO-Math-Reasoning/Gaokao2023-Math-En

Viewer • Updated Jun 1, 2024 • 385 • 57 • 6

hfl/stem_zh_instruction

Viewer • Updated May 13, 2024 • 256k • 434 • 29

liked a Space 7 months ago

Qwen2.5

Chat with Qwen, a helpful assistant