hjDuan
4090with431
·
AI & ML interests
None yet
Organizations
None yet
4090with431's activity
Can't reproduce the evaluation result of GPQA dataset
5
#47 opened 3 months ago
by
Rinn000

Tried the "strawberry" demo case but got wrong answer😂
2
#51 opened 3 months ago
by
HuggingLianWang