Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
heegyu
's Collections
Korean Reward Modeling
Korean Pretraining Dataset
RLHF papers
Reward Modeling Datasets
Pre-training Dataset
Vision LM
Image Generation
Domain Specific (Math, Code, etc)
Machine Translation
Safety LM
Text2SQL
Safety LM
updated
26 days ago
Upvote
-
meta-llama/LlamaGuard-7b
Text Generation
•
Updated
Apr 17
•
4.53k
•
205
meta-llama/Meta-Llama-Guard-2-8B
Text Generation
•
Updated
May 13
•
16.8k
•
273
OpenSafetyLab/MD-Judge-v0.1
Text Generation
•
Updated
May 20
•
1.53k
•
13
mcj311/saladbench_data
Viewer
•
Updated
Mar 28
•
30.4k
•
6
openbmb/UltraSafety
Viewer
•
Updated
Mar 16
•
3k
•
121
•
26
PKU-Alignment/BeaverTails
Viewer
•
Updated
Oct 17, 2023
•
364k
•
3.74k
•
30
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
25 days ago
•
164k
•
8.85k
•
109
Anthropic/hh-rlhf
Viewer
•
Updated
May 26, 2023
•
169k
•
35.5k
•
1.17k
lmsys/toxic-chat
Viewer
•
Updated
May 14
•
20.3k
•
5.67k
•
132
mmathys/openai-moderation-api-evaluation
Viewer
•
Updated
Aug 28, 2023
•
1.68k
•
135
•
18
allenai/WildChat-1M
Viewer
•
Updated
29 days ago
•
838k
•
727
•
269
allenai/wildjailbreak
Viewer
•
Updated
Aug 8
•
2.21k
•
1.24k
•
14
allenai/wildguardmix
Viewer
•
Updated
Jun 29
•
88.5k
•
11.6k
•
12
allenai/xstest-response
Viewer
•
Updated
Jun 29
•
895
•
4.41k
•
2
walledai/XSTest
Viewer
•
Updated
Jul 4
•
450
•
1.01k
•
3
meta-llama/Llama-Guard-3-8B
Text Generation
•
Updated
Aug 20
•
194k
•
108
Upvote
-
Share collection
View history
Collection guide
Browse collections