Commit History

Fix pydantic gradio compatibility error
3c28d4f

mirageco commited on

Add FinTrade SR dataset
cfd9447

mirageco commited on

Display results on the dashboard even if the result is missing by filling in "missings" into the column
ee62fba

mirageco commited on

Add small descriptions of each of the datasets
e9d718d

mirageco commited on

Fix Average_RM
46bbf3e

mirageco commited on

Add average checkbox for each individual category
91f5d94

mirageco commited on

Remove unused debug print. Add addition check for request files
da9cc09

mirageco commited on

Update src/leaderboard/read_evals.py
f2550e7
verified

Me1oy commited on

Update src/leaderboard/read_evals.py
5e023a2
verified

Me1oy commited on

Update src/leaderboard/read_evals.py
f91664b
verified

Me1oy commited on

Update src/leaderboard/read_evals.py
a5c393f
verified

Me1oy commited on

doc
c1b8a96

Clémentine commited on

simplified the template
24622c4

Clémentine commited on

now with a functionning backend
1ffc326

Clémentine commited on

update read
943f952

Clémentine commited on

fixs
314f91a

Clémentine commited on

Simplified leaderboard v0
9833cdb

Clémentine commited on

simplified some parts of the code + updated requirements
9d22eee

Clémentine commited on

add model architecture as column
3dfaf22

Clémentine commited on

Refactor 2 - added plotting back
b1a1395

Clémentine commited on

Fix requirements for mistral models - to change once transformers gets updated.
002172c

Clémentine commited on

fix col width
fc1e99b

Clémentine commited on

refacto style + rate limit
df66f6e

Clémentine commited on

Fix TruthfulQA NaN scores to 0
bb17be3

Clémentine commited on

refacto part 1
2a5f9fb

Clémentine commited on