PolarisEvals/llm_dataset_completness_2stage_justification_score Viewer • Updated Jun 13, 2024 • 54.3k • 46
PolarisEvals/shikib_dataset_completeness_2stage_unittest_debug Viewer • Updated Jun 12, 2024 • 100 • 21
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_completeness_2stage_unittest_response Viewer • Updated Jun 12, 2024 • 5.47k • 35
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_completeness_2stage_unittest Viewer • Updated Jun 11, 2024 • 912 • 28
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_filtering_debug Viewer • Updated Jun 11, 2024 • 100 • 38
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts Viewer • Updated Jun 11, 2024 • 912 • 33
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_questions_filtering_debug Viewer • Updated Jun 5, 2024 • 100 • 17
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_questions Viewer • Updated Jun 5, 2024 • 982 • 18
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_gpt-4-0613_outputs_json_True_debug Viewer • Updated Jun 4, 2024 • 100 • 17
PolarisEvals/training_criteria_dpo_distill_relevance_gpt-4-0613_outputs_json_True_debug Viewer • Updated Jun 3, 2024 • 100 • 18
PolarisEvals/synqa_hudson_300_samples_relevance_gpt-4-0613_outputs_json_True_debug Viewer • Updated May 30, 2024 • 100 • 22
PolarisEvals/synqa_hudson_300_samples_completeness_gpt-4-0613_outputs_json_True_debug Viewer • Updated May 30, 2024 • 100 • 22
PolarisEvals/synqa_hudson_300_samples_clarity_gpt-4-0613_outputs_json_True_debug Viewer • Updated May 30, 2024 • 100 • 16