Support comparing model tree generations 523fad9 verified albertvillanova HF staff commited on Nov 5, 2024
Fix load_results for None data frames 96f60e1 verified albertvillanova HF staff commited on Oct 31, 2024
Set result_paths_per_model as State 729af67 verified albertvillanova HF staff commited on Oct 31, 2024
Improve async loading performance of Details 4a05739 verified albertvillanova HF staff commited on Oct 29, 2024
Hide MATH fewshot_config.samples with memory address 22fb9eb verified albertvillanova HF staff commited on Oct 28, 2024
Refactor glob to use the cache of HfFileSystem 7e32ac7 verified albertvillanova HF staff commited on Oct 18, 2024
Align Details samples sorting by doc_id 6411b1c verified albertvillanova HF staff commited on Oct 17, 2024
Support .json and .jsonl details files 148216f verified albertvillanova HF staff commited on Oct 17, 2024
Add explanation that login is required for GPQA Details e970061 verified albertvillanova HF staff commited on Oct 17, 2024
Use color map for Results metrics values 581682a verified albertvillanova HF staff commited on Oct 17, 2024
Add checkbox in Details to show only differences 6cf57e4 verified albertvillanova HF staff commited on Oct 16, 2024
Add checkbox in Configs to show only differences f12aa56 verified albertvillanova HF staff commited on Oct 16, 2024
Add checkbox in Results to hide stderr 54e105e verified albertvillanova HF staff commited on Oct 16, 2024
Fix loading Details with documents containing end of lines 662ed4b verified albertvillanova HF staff commited on Oct 16, 2024
Add additional info to task description 651545d verified albertvillanova HF staff commited on Oct 15, 2024
Remove ARC task by hiding from All 6099782 verified albertvillanova HF staff commited on Oct 14, 2024
Highlight exact_match and change colors a4b20f4 verified albertvillanova HF staff commited on Oct 11, 2024
Highlight min/max Results accuracy 26e855f verified albertvillanova HF staff commited on Oct 11, 2024