Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
flowaicom
's Collections
Flow Judge Datasets v0
Flow LM Judge Evaluation Datasets
Flow Judge Datasets v0.1
Flow Judge v0.1 held-out test datasets
Flow-Judge-v0.1 out-of-domain evaluation datasets
Flow-Judge-v0.1
Flow Judge v0.1 held-out test datasets
updated
Sep 14
This collection contains held-out splits for testing Flow-Judge-v0.1.
Upvote
2
flowaicom/Flow-Judge-v0.1-binary-heldout
Viewer
•
Updated
Sep 18
•
316
•
45
flowaicom/Flow-Judge-v0.1-3-likert-heldout
Viewer
•
Updated
Sep 18
•
300
•
51
flowaicom/Flow-Judge-v0.1-5-likert-heldout
Viewer
•
Updated
Sep 18
•
274
•
76
Upvote
2
Share collection
View history
Collection guide
Browse collections