sumukshashidhar-testing/yourbench_y1_single_shot_questions_v2x_answers_reformatted_2 Viewer • Updated Dec 24, 2024 • 2.61k • 38
sumukshashidhar-testing/yourbench_y1_single_shot_questions_v2x_answers_judged Viewer • Updated Dec 24, 2024 • 2.61k • 43
sumukshashidhar-testing/yourbench_y1_single_shot_questions_v2x_answers_reformatted Viewer • Updated Dec 24, 2024 • 2.61k • 39
sumukshashidhar-testing/yourbench_y1_single_shot_questions_v2x_answers Viewer • Updated Dec 23, 2024 • 5.22k • 39
sumukshashidhar-testing/yourbench_y1_single_shot_questions_v2 Viewer • Updated Dec 21, 2024 • 2.61k • 38
sumukshashidhar-testing/yourbench_y1_singleshot_answers_reformatted Viewer • Updated Dec 21, 2024 • 3.49k • 37
sumukshashidhar-testing/yourbench_y1_single_shot_questions Viewer • Updated Dec 13, 2024 • 2.93k • 37
Democratizing LLMs: An Exploration of Cost-Performance Trade-offs in Self-Refined Open-Source Models Paper • 2310.07611 • Published Oct 11, 2023 • 2