Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
5
Joshua Vendrow
jvendrow
Follow
evendrow's profile picture
timmhaucke's profile picture
2 followers
·
8 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
Do Large Language Model Benchmarks Test Reliability?
liked
a dataset
about 1 month ago
madrylab/gsm8k-platinum
new
activity
about 1 month ago
madrylab/platinum-bench:
Grammatical error in squad task 5ad2b72fd7d075001a42a022
View all activity
Organizations
jvendrow
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
upvoted
a
paper
about 1 month ago
Do Large Language Model Benchmarks Test Reliability?
Paper
•
2502.03461
•
Published
Feb 5
•
3
liked
a dataset
about 1 month ago
madrylab/gsm8k-platinum
Viewer
•
Updated
Mar 11
•
1.21k
•
6.43k
•
35
New activity in
madrylab/platinum-bench
about 1 month ago
Grammatical error in squad task 5ad2b72fd7d075001a42a022
1
#1 opened 2 months ago by
masharpe
liked
2 datasets
2 months ago
madrylab/platinum-bench
Viewer
•
Updated
about 18 hours ago
•
3.3k
•
731
•
21
madrylab/platinum-bench-paper-version
Viewer
•
Updated
Feb 6
•
3.3k
•
113
•
2
updated
a dataset
2 months ago
madrylab/platinum-bench-paper-cache
Updated
Feb 6
•
115
•
1
liked
a dataset
5 months ago
evendrow/INQUIRE-Rerank
Viewer
•
Updated
Dec 14, 2024
•
20k
•
186
•
10
liked
a Space
about 2 years ago
Runtime error
37
37
Photoguard
🛡