PolarisEvals/llm_dataset_completness_2stage_justification_score Viewer • Updated Jun 13, 2024 • 54.3k • 35
PolarisEvals/shikib_dataset_completeness_2stage_unittest_debug Viewer • Updated Jun 12, 2024 • 100 • 29
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_completeness_2stage_unittest_response Viewer • Updated Jun 12, 2024 • 5.47k • 31
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_completeness_2stage_unittest Viewer • Updated Jun 11, 2024 • 912 • 32
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_filtering_debug Viewer • Updated Jun 11, 2024 • 100 • 31
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts Viewer • Updated Jun 11, 2024 • 912 • 30
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_questions_filtering_debug Viewer • Updated Jun 5, 2024 • 100 • 30
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_questions Viewer • Updated Jun 5, 2024 • 982 • 29
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_gpt-4-0613_outputs_json_True_debug Viewer • Updated Jun 4, 2024 • 100 • 30
PolarisEvals/training_criteria_dpo_distill_relevance_gpt-4-0613_outputs_json_True_debug Viewer • Updated Jun 3, 2024 • 100 • 30
PolarisEvals/synqa_hudson_300_samples_relevance_gpt-4-0613_outputs_json_True_debug Viewer • Updated May 30, 2024 • 100 • 27
PolarisEvals/synqa_hudson_300_samples_completeness_gpt-4-0613_outputs_json_True_debug Viewer • Updated May 30, 2024 • 100 • 29
PolarisEvals/synqa_hudson_300_samples_clarity_gpt-4-0613_outputs_json_True_debug Viewer • Updated May 30, 2024 • 100 • 29