Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kaiwen Wang's picture
7 2

Kaiwen Wang

kaiwenw
·
https://kaiwenw.github.io/
  • kaiwenw_ai
  • kaiwenw

AI & ML interests

Reinforcement Learning

Recent Activity

new activity 17 days ago
VGS-AI/OpenR1-Cleaned:Add link to GitHub repository and example usage
new activity 18 days ago
VGS-AI/OpenR1-VM:Add link to codebase
new activity 18 days ago
VGS-AI/DeepSeek-VM-1.5B:Improve model card for Value-Guided Search
View all activity

Organizations

VGS's profile picture

kaiwenw 's datasets 220

kaiwenw/sep19_eft_gpt4o

Viewer • Updated Oct 31, 2024 • 6.28k • 37 • 1

kaiwenw/oct30_oasst_gpt4o_jft_strict

Viewer • Updated Oct 31, 2024 • 3.87k • 29

kaiwenw/oct30_oasst_gpt4o_jft

Viewer • Updated Oct 31, 2024 • 6.7k • 26

kaiwenw/oct30_oasst_llama70b_jft_strict

Viewer • Updated Oct 31, 2024 • 3.69k • 29

kaiwenw/oct30_oasst_llama70b_jft

Viewer • Updated Oct 31, 2024 • 6.25k • 29

kaiwenw/oct28_selfplay_jft_strict

Viewer • Updated Oct 29, 2024 • 1.22k • 33

kaiwenw/oct28_selfplay_jft

Viewer • Updated Oct 29, 2024 • 6.73k • 29

kaiwenw/oct28_selfplay_try2

Viewer • Updated Oct 28, 2024 • 3.64k • 31

kaiwenw/oasst

Viewer • Updated Oct 27, 2024 • 3.64k • 29

kaiwenw/ultrafeedback-gemma2-9b-it-SimPO-vllm

Viewer • Updated Oct 24, 2024 • 61.5k • 29
  • Previous
  • 1
  • ...
  • 6
  • 7
  • 8
  • Next
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs