kang

qiyue

AI & ML interests

None yet

Recent Activity

liked a model about 2 months ago

deepseek-ai/DeepSeek-V3

upvoted an article 3 months ago

Hugging Face welcomes the Aya Expanse family of multilingual models

upvoted a paper 5 months ago

Training Language Models to Self-Correct via Reinforcement Learning

View all activity

Organizations

None yet

qiyue's activity

liked a model about 2 months ago

deepseek-ai/DeepSeek-V3

Text Generation • Updated 17 days ago • 1.22M • • 3.33k

upvoted an article 3 months ago

Article

Hugging Face welcomes the Aya Expanse family of multilingual models

•

Oct 24, 2024

• 10

upvoted a paper 5 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 136

liked a model 5 months ago

mistralai/Mistral-Small-Instruct-2409

Updated Oct 16, 2024 • 43.5k • 379

upvoted an article 6 months ago

Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention

Aug 21, 2024

• 29

upvoted a paper 7 months ago

Understanding Reference Policies in Direct Preference Optimization

Paper • 2407.13709 • Published Jul 18, 2024 • 17

upvoted 2 articles 7 months ago

Article

RegMix: Data Mixture as Regression for Language Model Pre-training

•

Jul 11, 2024

• 11

Article

The Rise of Agentic Data Generation

•

Jul 15, 2024

• 81

liked a dataset 8 months ago

tasksource/tasksource_dpo_pairs

Viewer • Updated Jul 1, 2024 • 5.13M • 315 • 21

upvoted an article 8 months ago

Article

Putting RL back in RLHF

Jun 12, 2024

• 75

liked 3 datasets 9 months ago

liked 2 models 10 months ago

mlabonne/OrpoLlama-3-8B

Text Generation • Updated Jun 15, 2024 • 30 • 53

NousResearch/Meta-Llama-3-8B

Text Generation • Updated Apr 30, 2024 • 25.6k • 96

liked a dataset 10 months ago

data-is-better-together/10k_prompts_ranked

Viewer • Updated Mar 7, 2024 • 10.3k • 557 • 146

liked a dataset 11 months ago

OpenLeecher/Teatime

Updated Jul 9, 2023 • 253 • 34

liked 2 datasets about 1 year ago

openbmb/UltraFeedback

Viewer • Updated Dec 29, 2023 • 64k • 2.3k • 347

argilla/ultrafeedback-binarized-preferences

Viewer • Updated Nov 30, 2023 • 63.6k • 467 • 70

liked a model about 1 year ago

xDAN-AI/xDAN-L1-Chat-RL-v1

Text Generation • Updated Dec 29, 2023 • 1.59k • 63