87 45 161

Yaowei Zheng

hiyouga

https://github.com/hiyouga

AI & ML interests

LLM Knowledge Management

Recent Activity

liked a model 1 day ago

meta-llama/Llama-4-Scout-17B-16E-Instruct

liked a model 1 day ago

open-thoughts/OpenThinker2-32B

new activity 3 days ago

meta-llama/Llama-4-Scout-17B-16E-Instruct:Llama 4 - open-source fine-tuning script

View all activity

Organizations

hiyouga's activity

liked 2 models 1 day ago

meta-llama/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • Updated 2 days ago • 101k • • 630

open-thoughts/OpenThinker2-32B

Text Generation • Updated 5 days ago • 280 • 36

New activity in meta-llama/Llama-4-Scout-17B-16E-Instruct 3 days ago

Llama 4 - open-source fine-tuning script

#27 opened 3 days ago by

hiyouga

New activity in Qwen/Qwen2.5-Omni-7B 3 days ago

Open-source Fine-tuning script of Qwen2.5-Omni 7B 🚀

#29 opened 7 days ago by

hiyouga

updated a model 3 days ago

llamafactory/tiny-random-Llama-4

Image-Text-to-Text • Updated 3 days ago • 12

published a model 3 days ago

llamafactory/tiny-random-Llama-4

Image-Text-to-Text • Updated 3 days ago • 12

liked a model 8 days ago

Qwen/Qwen2.5-Omni-7B

Any-to-Any • Updated 8 days ago • 105k • 1.26k

upvoted a paper 9 days ago

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published 12 days ago • 43

liked a dataset 11 days ago

m-a-p/neo_sft_phase2

Viewer • Updated Jun 12, 2024 • 109k • 161 • 53

liked a model 14 days ago

manycore-research/SpatialLM-Llama-1B

Text Generation • Updated 19 days ago • 16k • 921

New activity in hiyouga/gsm8k 22 days ago

[bot] Conversion to Parquet

#1 opened 22 days ago by

parquet-converter

updated a dataset 22 days ago

hiyouga/gsm8k

Viewer • Updated 22 days ago • 8.79k • 59

published a dataset 23 days ago

hiyouga/gsm8k

Viewer • Updated 22 days ago • 8.79k • 59

upvoted a paper 24 days ago

Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond

Paper • 2503.10460 • Published 26 days ago • 27

liked a model 27 days ago

google/gemma-3-4b-it

Image-Text-to-Text • Updated 18 days ago • 546k • 411

upvoted an article 27 days ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

28 days ago

• 378

upvoted a paper 28 days ago

EasyRAG: Efficient Retrieval-Augmented Generation Framework for Automated Network Operations

Paper • 2410.10315 • Published Oct 14, 2024 • 3

authored 2 papers 28 days ago

Regularizing Neural Networks via Adversarial Model Perturbation

Paper • 2010.04925 • Published Oct 10, 2020

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published Jan 21 • 57

liked a model 28 days ago

google/gemma-3-12b-pt

Image-Text-to-Text • Updated 18 days ago • 23.8k • 41