Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
12
10
19
Wei Xiong
weqweasdas
Follow
jinzhuoran's profile picture
GigaBoy's profile picture
circulartext's profile picture
15 followers
·
18 following
https://weixiongust.github.io/WeiXiongUST/index.html
AI & ML interests
Machine learning, RLHF
Recent Activity
updated
a dataset
about 4 hours ago
dsrtrain/merged_turn1_turn2_warmup
published
a dataset
about 4 hours ago
dsrtrain/merged_turn1_turn2_warmup
updated
a dataset
about 4 hours ago
dsrtrain/raft_merged_turn1_turn2
View all activity
Organizations
weqweasdas
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
24 days ago
RLHFlow/Llama3.1-8B-PRM-Deepseek-Data
Text Generation
•
Updated
Nov 9, 2024
•
15.8k
•
32
liked
a dataset
3 months ago
RLHFlow/RLHFlow-SFT-Dataset-ver2
Viewer
•
Updated
Nov 2, 2024
•
2.32M
•
82
•
5
liked
a model
3 months ago
RLHFlow/Llama3.1-8B-PRM-Mistral-Data
Text Generation
•
Updated
Nov 9, 2024
•
796
•
8
liked
2 models
6 months ago
NCSOFT/Llama-3-OffsetBias-RM-8B
Text Classification
•
Updated
Sep 6, 2024
•
4.75k
•
22
RLHFlow/LLaMA3-SFT
Text Generation
•
Updated
Nov 3, 2024
•
6.9k
•
10
liked
a model
8 months ago
RLHFlow/LLaMA3-iterative-DPO-final
Text Generation
•
Updated
Oct 14, 2024
•
6.53k
•
40
liked
5 models
9 months ago
RLHFlow/ArmoRM-Llama3-8B-v0.1
Text Classification
•
Updated
Sep 23, 2024
•
52.1k
•
167
RLHFlow/pair-preference-model-LLaMA3-8B
Text Generation
•
Updated
Oct 14, 2024
•
1.81k
•
38
Salesforce/LLaMA-3-8B-SFR-RM-R
Text Classification
•
Updated
19 days ago
•
17
•
11
Salesforce/LLaMA-3-8B-SFR-SFT-R
Text Generation
•
Updated
19 days ago
•
31
•
8
Salesforce/LLaMA-3-8B-SFR-Iterative-DPO-R
Text Generation
•
Updated
19 days ago
•
174
•
77
liked
2 models
10 months ago
sfairXC/FsfairX-LLaMA3-RM-v0.1
Text Classification
•
Updated
Oct 14, 2024
•
5.46k
•
54
sfairXC/FsfairX-Zephyr-Chat-v0.1
Text Generation
•
Updated
Apr 24, 2024
•
117
•
8
liked
a model
11 months ago
weqweasdas/RM-Mistral-7B
Text Classification
•
Updated
Mar 31, 2024
•
1.02k
•
22
liked
a Space
11 months ago
Running
326
326
Reward Bench Leaderboard
📐
Explore and analyze RewardBench leaderboard data
liked
2 models
12 months ago
weqweasdas/RM-Gemma-7B
Text Classification
•
Updated
Mar 22, 2024
•
55
•
8
weqweasdas/RM-Gemma-2B
Text Classification
•
Updated
Mar 22, 2024
•
7.01k
•
22
liked
a model
over 1 year ago
weqweasdas/hh_rlhf_rm_open_llama_3b
Text Classification
•
Updated
Feb 25, 2024
•
980
•
17
liked
a Space
almost 2 years ago
Runtime error
66
66
Robin 7b
🔥