all-MiniLM-L6-v2 trained on MEDI-MTEB triplets
This is a sentence-transformers model finetuned from sentence-transformers/all-MiniLM-L6-v2 on the NQ, pubmed, specter_train_triples, S2ORC_citations_abstracts, fever, gooaq_pairs, codesearchnet, wikihow, WikiAnswers, eli5_question_answer, amazon-qa, medmcqa, zeroshot, TriviaQA_pairs, PAQ_pairs, stackexchange_duplicate_questions_title-body_title-body, trex, flickr30k_captions, hotpotqa, task671_ambigqa_text_generation, task061_ropes_answer_generation, task285_imdb_answer_generation, task905_hate_speech_offensive_classification, task566_circa_classification, task184_snli_entailment_to_neutral_text_modification, task280_stereoset_classification_stereotype_type, task1599_smcalflow_classification, task1384_deal_or_no_dialog_classification, task591_sciq_answer_generation, task823_peixian-rtgender_sentiment_analysis, task023_cosmosqa_question_generation, task900_freebase_qa_category_classification, task924_event2mind_word_generation, task152_tomqa_find_location_easy_noise, task1368_healthfact_sentence_generation, task1661_super_glue_classification, task1187_politifact_classification, task1728_web_nlg_data_to_text, task112_asset_simple_sentence_identification, task1340_msr_text_compression_compression, task072_abductivenli_answer_generation, task1504_hatexplain_answer_generation, task684_online_privacy_policy_text_information_type_generation, task1290_xsum_summarization, task075_squad1.1_answer_generation, task1587_scifact_classification, task384_socialiqa_question_classification, task1555_scitail_answer_generation, task1532_daily_dialog_emotion_classification, task239_tweetqa_answer_generation, task596_mocha_question_generation, task1411_dart_subject_identification, task1359_numer_sense_answer_generation, task329_gap_classification, task220_rocstories_title_classification, task316_crows-pairs_classification_stereotype, task495_semeval_headline_classification, task1168_brown_coarse_pos_tagging, task348_squad2.0_unanswerable_question_generation, task049_multirc_questions_needed_to_answer, task1534_daily_dialog_question_classification, task322_jigsaw_classification_threat, task295_semeval_2020_task4_commonsense_reasoning, task186_snli_contradiction_to_entailment_text_modification, task034_winogrande_question_modification_object, task160_replace_letter_in_a_sentence, task469_mrqa_answer_generation, task105_story_cloze-rocstories_sentence_generation, task649_race_blank_question_generation, task1536_daily_dialog_happiness_classification, task683_online_privacy_policy_text_purpose_answer_generation, task024_cosmosqa_answer_generation, task584_udeps_eng_fine_pos_tagging, task066_timetravel_binary_consistency_classification, task413_mickey_en_sentence_perturbation_generation, task182_duorc_question_generation, task028_drop_answer_generation, task1601_webquestions_answer_generation, task1295_adversarial_qa_question_answering, task201_mnli_neutral_classification, task038_qasc_combined_fact, task293_storycommonsense_emotion_text_generation, task572_recipe_nlg_text_generation, task517_emo_classify_emotion_of_dialogue, task382_hybridqa_answer_generation, task176_break_decompose_questions, task1291_multi_news_summarization, task155_count_nouns_verbs, task031_winogrande_question_generation_object, task279_stereoset_classification_stereotype, task1336_peixian_equity_evaluation_corpus_gender_classifier, task508_scruples_dilemmas_more_ethical_isidentifiable, task518_emo_different_dialogue_emotions, task077_splash_explanation_to_sql, task923_event2mind_classifier, task470_mrqa_question_generation, task638_multi_woz_classification, task1412_web_questions_question_answering, task847_pubmedqa_question_generation, task678_ollie_actual_relationship_answer_generation, task290_tellmewhy_question_answerability, task575_air_dialogue_classification, task189_snli_neutral_to_contradiction_text_modification, task026_drop_question_generation, task162_count_words_starting_with_letter, task079_conala_concat_strings, task610_conllpp_ner, task046_miscellaneous_question_typing, task197_mnli_domain_answer_generation, task1325_qa_zre_question_generation_on_subject_relation, task430_senteval_subject_count, task672_nummersense, task402_grailqa_paraphrase_generation, task904_hate_speech_offensive_classification, task192_hotpotqa_sentence_generation, task069_abductivenli_classification, task574_air_dialogue_sentence_generation, task187_snli_entailment_to_contradiction_text_modification, task749_glucose_reverse_cause_emotion_detection, task1552_scitail_question_generation, task750_aqua_multiple_choice_answering, task327_jigsaw_classification_toxic, task1502_hatexplain_classification, task328_jigsaw_classification_insult, task304_numeric_fused_head_resolution, task1293_kilt_tasks_hotpotqa_question_answering, task216_rocstories_correct_answer_generation, task1326_qa_zre_question_generation_from_answer, task1338_peixian_equity_evaluation_corpus_sentiment_classifier, task1729_personachat_generate_next, task1202_atomic_classification_xneed, task400_paws_paraphrase_classification, task502_scruples_anecdotes_whoiswrong_verification, task088_identify_typo_verification, task221_rocstories_two_choice_classification, task200_mnli_entailment_classification, task074_squad1.1_question_generation, task581_socialiqa_question_generation, task1186_nne_hrngo_classification, task898_freebase_qa_answer_generation, task1408_dart_similarity_classification, task168_strategyqa_question_decomposition, task1357_xlsum_summary_generation, task390_torque_text_span_selection, task165_mcscript_question_answering_commonsense, task1533_daily_dialog_formal_classification, task002_quoref_answer_generation, task1297_qasc_question_answering, task305_jeopardy_answer_generation_normal, task029_winogrande_full_object, task1327_qa_zre_answer_generation_from_question, task326_jigsaw_classification_obscene, task1542_every_ith_element_from_starting, task570_recipe_nlg_ner_generation, task1409_dart_text_generation, task401_numeric_fused_head_reference, task846_pubmedqa_classification, task1712_poki_classification, task344_hybridqa_answer_generation, task875_emotion_classification, task1214_atomic_classification_xwant, task106_scruples_ethical_judgment, task238_iirc_answer_from_passage_answer_generation, task1391_winogrande_easy_answer_generation, task195_sentiment140_classification, task163_count_words_ending_with_letter, task579_socialiqa_classification, task569_recipe_nlg_text_generation, task1602_webquestion_question_genreation, task747_glucose_cause_emotion_detection, task219_rocstories_title_answer_generation, task178_quartz_question_answering, task103_facts2story_long_text_generation, task301_record_question_generation, task1369_healthfact_sentence_generation, task515_senteval_odd_word_out, task496_semeval_answer_generation, task1658_billsum_summarization, task1204_atomic_classification_hinderedby, task1392_superglue_multirc_answer_verification, task306_jeopardy_answer_generation_double, task1286_openbookqa_question_answering, task159_check_frequency_of_words_in_sentence_pair, task151_tomqa_find_location_easy_clean, task323_jigsaw_classification_sexually_explicit, task037_qasc_generate_related_fact, task027_drop_answer_type_generation, task1596_event2mind_text_generation_2, task141_odd-man-out_classification_category, task194_duorc_answer_generation, task679_hope_edi_english_text_classification, task246_dream_question_generation, task1195_disflqa_disfluent_to_fluent_conversion, task065_timetravel_consistent_sentence_classification, task351_winomt_classification_gender_identifiability_anti, task580_socialiqa_answer_generation, task583_udeps_eng_coarse_pos_tagging, task202_mnli_contradiction_classification, task222_rocstories_two_chioce_slotting_classification, task498_scruples_anecdotes_whoiswrong_classification, task067_abductivenli_answer_generation, task616_cola_classification, task286_olid_offense_judgment, task188_snli_neutral_to_entailment_text_modification, task223_quartz_explanation_generation, task820_protoqa_answer_generation, task196_sentiment140_answer_generation, task1678_mathqa_answer_selection, task349_squad2.0_answerable_unanswerable_question_classification, task154_tomqa_find_location_hard_noise, task333_hateeval_classification_hate_en, task235_iirc_question_from_subtext_answer_generation, task1554_scitail_classification, task210_logic2text_structured_text_generation, task035_winogrande_question_modification_person, task230_iirc_passage_classification, task1356_xlsum_title_generation, task1726_mathqa_correct_answer_generation, task302_record_classification, task380_boolq_yes_no_question, task212_logic2text_classification, task748_glucose_reverse_cause_event_detection, task834_mathdataset_classification, task350_winomt_classification_gender_identifiability_pro, task191_hotpotqa_question_generation, task236_iirc_question_from_passage_answer_generation, task217_rocstories_ordering_answer_generation, task568_circa_question_generation, task614_glucose_cause_event_detection, task361_spolin_yesand_prompt_response_classification, task421_persent_sentence_sentiment_classification, task203_mnli_sentence_generation, task420_persent_document_sentiment_classification, task153_tomqa_find_location_hard_clean, task346_hybridqa_classification, task1211_atomic_classification_hassubevent, task360_spolin_yesand_response_generation, task510_reddit_tifu_title_summarization, task511_reddit_tifu_long_text_summarization, task345_hybridqa_answer_generation, task270_csrg_counterfactual_context_generation, task307_jeopardy_answer_generation_final, task001_quoref_question_generation, task089_swap_words_verification, task1196_atomic_classification_oeffect, task080_piqa_answer_generation, task1598_nyc_long_text_generation, task240_tweetqa_question_generation, task615_moviesqa_answer_generation, task1347_glue_sts-b_similarity_classification, task114_is_the_given_word_longest, task292_storycommonsense_character_text_generation, task115_help_advice_classification, task431_senteval_object_count, task1360_numer_sense_multiple_choice_qa_generation, task177_para-nmt_paraphrasing, task132_dais_text_modification, task269_csrg_counterfactual_story_generation, task233_iirc_link_exists_classification, task161_count_words_containing_letter, task1205_atomic_classification_isafter, task571_recipe_nlg_ner_generation, task1292_yelp_review_full_text_categorization, task428_senteval_inversion, task311_race_question_generation, task429_senteval_tense, task403_creak_commonsense_inference, task929_products_reviews_classification, task582_naturalquestion_answer_generation, task237_iirc_answer_from_subtext_answer_generation, task050_multirc_answerability, task184_break_generate_question, task669_ambigqa_answer_generation, task169_strategyqa_sentence_generation, task500_scruples_anecdotes_title_generation, task241_tweetqa_classification, task1345_glue_qqp_question_paraprashing, task218_rocstories_swap_order_answer_generation, task613_politifact_text_generation, task1167_penn_treebank_coarse_pos_tagging, task1422_mathqa_physics, task247_dream_answer_generation, task199_mnli_classification, task164_mcscript_question_answering_text, task1541_agnews_classification, task516_senteval_conjoints_inversion, task294_storycommonsense_motiv_text_generation, task501_scruples_anecdotes_post_type_verification, task213_rocstories_correct_ending_classification, task821_protoqa_question_generation, task493_review_polarity_classification, task308_jeopardy_answer_generation_all, task1595_event2mind_text_generation_1, task040_qasc_question_generation, task231_iirc_link_classification, task1727_wiqa_what_is_the_effect, task578_curiosity_dialogs_answer_generation, task310_race_classification, task309_race_answer_generation, task379_agnews_topic_classification, task030_winogrande_full_person, task1540_parsed_pdfs_summarization, task039_qasc_find_overlapping_words, task1206_atomic_classification_isbefore, task157_count_vowels_and_consonants, task339_record_answer_generation, task453_swag_answer_generation, task848_pubmedqa_classification, task673_google_wellformed_query_classification, task676_ollie_relationship_answer_generation, task268_casehold_legal_answer_generation, task844_financial_phrasebank_classification, task330_gap_answer_generation, task595_mocha_answer_generation, task1285_kpa_keypoint_matching, task234_iirc_passage_line_answer_generation, task494_review_polarity_answer_generation, task670_ambigqa_question_generation, task289_gigaword_summarization, npr, nli, SimpleWiki, amazon_review_2018, ccnews_title_text, agnews, xsum, msmarco, yahoo_answers_title_answer, squad_pairs, wow, mteb-amazon_counterfactual-avs_triplets, mteb-amazon_massive_intent-avs_triplets, mteb-amazon_massive_scenario-avs_triplets, mteb-amazon_reviews_multi-avs_triplets, mteb-banking77-avs_triplets, mteb-emotion-avs_triplets, mteb-imdb-avs_triplets, mteb-mtop_domain-avs_triplets, mteb-mtop_intent-avs_triplets, mteb-toxic_conversations_50k-avs_triplets, mteb-tweet_sentiment_extraction-avs_triplets and covid-bing-query-gpt4-avs_triplets datasets. It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
Model Details
Model Description
- Model Type: Sentence Transformer
- Base model: sentence-transformers/all-MiniLM-L6-v2
- Maximum Sequence Length: 256 tokens
- Output Dimensionality: 384 tokens
- Similarity Function: Cosine Similarity
- Training Datasets:
- NQ
- pubmed
- specter_train_triples
- S2ORC_citations_abstracts
- fever
- gooaq_pairs
- codesearchnet
- wikihow
- WikiAnswers
- eli5_question_answer
- amazon-qa
- medmcqa
- zeroshot
- TriviaQA_pairs
- PAQ_pairs
- stackexchange_duplicate_questions_title-body_title-body
- trex
- flickr30k_captions
- hotpotqa
- task671_ambigqa_text_generation
- task061_ropes_answer_generation
- task285_imdb_answer_generation
- task905_hate_speech_offensive_classification
- task566_circa_classification
- task184_snli_entailment_to_neutral_text_modification
- task280_stereoset_classification_stereotype_type
- task1599_smcalflow_classification
- task1384_deal_or_no_dialog_classification
- task591_sciq_answer_generation
- task823_peixian-rtgender_sentiment_analysis
- task023_cosmosqa_question_generation
- task900_freebase_qa_category_classification
- task924_event2mind_word_generation
- task152_tomqa_find_location_easy_noise
- task1368_healthfact_sentence_generation
- task1661_super_glue_classification
- task1187_politifact_classification
- task1728_web_nlg_data_to_text
- task112_asset_simple_sentence_identification
- task1340_msr_text_compression_compression
- task072_abductivenli_answer_generation
- task1504_hatexplain_answer_generation
- task684_online_privacy_policy_text_information_type_generation
- task1290_xsum_summarization
- task075_squad1.1_answer_generation
- task1587_scifact_classification
- task384_socialiqa_question_classification
- task1555_scitail_answer_generation
- task1532_daily_dialog_emotion_classification
- task239_tweetqa_answer_generation
- task596_mocha_question_generation
- task1411_dart_subject_identification
- task1359_numer_sense_answer_generation
- task329_gap_classification
- task220_rocstories_title_classification
- task316_crows-pairs_classification_stereotype
- task495_semeval_headline_classification
- task1168_brown_coarse_pos_tagging
- task348_squad2.0_unanswerable_question_generation
- task049_multirc_questions_needed_to_answer
- task1534_daily_dialog_question_classification
- task322_jigsaw_classification_threat
- task295_semeval_2020_task4_commonsense_reasoning
- task186_snli_contradiction_to_entailment_text_modification
- task034_winogrande_question_modification_object
- task160_replace_letter_in_a_sentence
- task469_mrqa_answer_generation
- task105_story_cloze-rocstories_sentence_generation
- task649_race_blank_question_generation
- task1536_daily_dialog_happiness_classification
- task683_online_privacy_policy_text_purpose_answer_generation
- task024_cosmosqa_answer_generation
- task584_udeps_eng_fine_pos_tagging
- task066_timetravel_binary_consistency_classification
- task413_mickey_en_sentence_perturbation_generation
- task182_duorc_question_generation
- task028_drop_answer_generation
- task1601_webquestions_answer_generation
- task1295_adversarial_qa_question_answering
- task201_mnli_neutral_classification
- task038_qasc_combined_fact
- task293_storycommonsense_emotion_text_generation
- task572_recipe_nlg_text_generation
- task517_emo_classify_emotion_of_dialogue
- task382_hybridqa_answer_generation
- task176_break_decompose_questions
- task1291_multi_news_summarization
- task155_count_nouns_verbs
- task031_winogrande_question_generation_object
- task279_stereoset_classification_stereotype
- task1336_peixian_equity_evaluation_corpus_gender_classifier
- task508_scruples_dilemmas_more_ethical_isidentifiable
- task518_emo_different_dialogue_emotions
- task077_splash_explanation_to_sql
- task923_event2mind_classifier
- task470_mrqa_question_generation
- task638_multi_woz_classification
- task1412_web_questions_question_answering
- task847_pubmedqa_question_generation
- task678_ollie_actual_relationship_answer_generation
- task290_tellmewhy_question_answerability
- task575_air_dialogue_classification
- task189_snli_neutral_to_contradiction_text_modification
- task026_drop_question_generation
- task162_count_words_starting_with_letter
- task079_conala_concat_strings
- task610_conllpp_ner
- task046_miscellaneous_question_typing
- task197_mnli_domain_answer_generation
- task1325_qa_zre_question_generation_on_subject_relation
- task430_senteval_subject_count
- task672_nummersense
- task402_grailqa_paraphrase_generation
- task904_hate_speech_offensive_classification
- task192_hotpotqa_sentence_generation
- task069_abductivenli_classification
- task574_air_dialogue_sentence_generation
- task187_snli_entailment_to_contradiction_text_modification
- task749_glucose_reverse_cause_emotion_detection
- task1552_scitail_question_generation
- task750_aqua_multiple_choice_answering
- task327_jigsaw_classification_toxic
- task1502_hatexplain_classification
- task328_jigsaw_classification_insult
- task304_numeric_fused_head_resolution
- task1293_kilt_tasks_hotpotqa_question_answering
- task216_rocstories_correct_answer_generation
- task1326_qa_zre_question_generation_from_answer
- task1338_peixian_equity_evaluation_corpus_sentiment_classifier
- task1729_personachat_generate_next
- task1202_atomic_classification_xneed
- task400_paws_paraphrase_classification
- task502_scruples_anecdotes_whoiswrong_verification
- task088_identify_typo_verification
- task221_rocstories_two_choice_classification
- task200_mnli_entailment_classification
- task074_squad1.1_question_generation
- task581_socialiqa_question_generation
- task1186_nne_hrngo_classification
- task898_freebase_qa_answer_generation
- task1408_dart_similarity_classification
- task168_strategyqa_question_decomposition
- task1357_xlsum_summary_generation
- task390_torque_text_span_selection
- task165_mcscript_question_answering_commonsense
- task1533_daily_dialog_formal_classification
- task002_quoref_answer_generation
- task1297_qasc_question_answering
- task305_jeopardy_answer_generation_normal
- task029_winogrande_full_object
- task1327_qa_zre_answer_generation_from_question
- task326_jigsaw_classification_obscene
- task1542_every_ith_element_from_starting
- task570_recipe_nlg_ner_generation
- task1409_dart_text_generation
- task401_numeric_fused_head_reference
- task846_pubmedqa_classification
- task1712_poki_classification
- task344_hybridqa_answer_generation
- task875_emotion_classification
- task1214_atomic_classification_xwant
- task106_scruples_ethical_judgment
- task238_iirc_answer_from_passage_answer_generation
- task1391_winogrande_easy_answer_generation
- task195_sentiment140_classification
- task163_count_words_ending_with_letter
- task579_socialiqa_classification
- task569_recipe_nlg_text_generation
- task1602_webquestion_question_genreation
- task747_glucose_cause_emotion_detection
- task219_rocstories_title_answer_generation
- task178_quartz_question_answering
- task103_facts2story_long_text_generation
- task301_record_question_generation
- task1369_healthfact_sentence_generation
- task515_senteval_odd_word_out
- task496_semeval_answer_generation
- task1658_billsum_summarization
- task1204_atomic_classification_hinderedby
- task1392_superglue_multirc_answer_verification
- task306_jeopardy_answer_generation_double
- task1286_openbookqa_question_answering
- task159_check_frequency_of_words_in_sentence_pair
- task151_tomqa_find_location_easy_clean
- task323_jigsaw_classification_sexually_explicit
- task037_qasc_generate_related_fact
- task027_drop_answer_type_generation
- task1596_event2mind_text_generation_2
- task141_odd-man-out_classification_category
- task194_duorc_answer_generation
- task679_hope_edi_english_text_classification
- task246_dream_question_generation
- task1195_disflqa_disfluent_to_fluent_conversion
- task065_timetravel_consistent_sentence_classification
- task351_winomt_classification_gender_identifiability_anti
- task580_socialiqa_answer_generation
- task583_udeps_eng_coarse_pos_tagging
- task202_mnli_contradiction_classification
- task222_rocstories_two_chioce_slotting_classification
- task498_scruples_anecdotes_whoiswrong_classification
- task067_abductivenli_answer_generation
- task616_cola_classification
- task286_olid_offense_judgment
- task188_snli_neutral_to_entailment_text_modification
- task223_quartz_explanation_generation
- task820_protoqa_answer_generation
- task196_sentiment140_answer_generation
- task1678_mathqa_answer_selection
- task349_squad2.0_answerable_unanswerable_question_classification
- task154_tomqa_find_location_hard_noise
- task333_hateeval_classification_hate_en
- task235_iirc_question_from_subtext_answer_generation
- task1554_scitail_classification
- task210_logic2text_structured_text_generation
- task035_winogrande_question_modification_person
- task230_iirc_passage_classification
- task1356_xlsum_title_generation
- task1726_mathqa_correct_answer_generation
- task302_record_classification
- task380_boolq_yes_no_question
- task212_logic2text_classification
- task748_glucose_reverse_cause_event_detection
- task834_mathdataset_classification
- task350_winomt_classification_gender_identifiability_pro
- task191_hotpotqa_question_generation
- task236_iirc_question_from_passage_answer_generation
- task217_rocstories_ordering_answer_generation
- task568_circa_question_generation
- task614_glucose_cause_event_detection
- task361_spolin_yesand_prompt_response_classification
- task421_persent_sentence_sentiment_classification
- task203_mnli_sentence_generation
- task420_persent_document_sentiment_classification
- task153_tomqa_find_location_hard_clean
- task346_hybridqa_classification
- task1211_atomic_classification_hassubevent
- task360_spolin_yesand_response_generation
- task510_reddit_tifu_title_summarization
- task511_reddit_tifu_long_text_summarization
- task345_hybridqa_answer_generation
- task270_csrg_counterfactual_context_generation
- task307_jeopardy_answer_generation_final
- task001_quoref_question_generation
- task089_swap_words_verification
- task1196_atomic_classification_oeffect
- task080_piqa_answer_generation
- task1598_nyc_long_text_generation
- task240_tweetqa_question_generation
- task615_moviesqa_answer_generation
- task1347_glue_sts-b_similarity_classification
- task114_is_the_given_word_longest
- task292_storycommonsense_character_text_generation
- task115_help_advice_classification
- task431_senteval_object_count
- task1360_numer_sense_multiple_choice_qa_generation
- task177_para-nmt_paraphrasing
- task132_dais_text_modification
- task269_csrg_counterfactual_story_generation
- task233_iirc_link_exists_classification
- task161_count_words_containing_letter
- task1205_atomic_classification_isafter
- task571_recipe_nlg_ner_generation
- task1292_yelp_review_full_text_categorization
- task428_senteval_inversion
- task311_race_question_generation
- task429_senteval_tense
- task403_creak_commonsense_inference
- task929_products_reviews_classification
- task582_naturalquestion_answer_generation
- task237_iirc_answer_from_subtext_answer_generation
- task050_multirc_answerability
- task184_break_generate_question
- task669_ambigqa_answer_generation
- task169_strategyqa_sentence_generation
- task500_scruples_anecdotes_title_generation
- task241_tweetqa_classification
- task1345_glue_qqp_question_paraprashing
- task218_rocstories_swap_order_answer_generation
- task613_politifact_text_generation
- task1167_penn_treebank_coarse_pos_tagging
- task1422_mathqa_physics
- task247_dream_answer_generation
- task199_mnli_classification
- task164_mcscript_question_answering_text
- task1541_agnews_classification
- task516_senteval_conjoints_inversion
- task294_storycommonsense_motiv_text_generation
- task501_scruples_anecdotes_post_type_verification
- task213_rocstories_correct_ending_classification
- task821_protoqa_question_generation
- task493_review_polarity_classification
- task308_jeopardy_answer_generation_all
- task1595_event2mind_text_generation_1
- task040_qasc_question_generation
- task231_iirc_link_classification
- task1727_wiqa_what_is_the_effect
- task578_curiosity_dialogs_answer_generation
- task310_race_classification
- task309_race_answer_generation
- task379_agnews_topic_classification
- task030_winogrande_full_person
- task1540_parsed_pdfs_summarization
- task039_qasc_find_overlapping_words
- task1206_atomic_classification_isbefore
- task157_count_vowels_and_consonants
- task339_record_answer_generation
- task453_swag_answer_generation
- task848_pubmedqa_classification
- task673_google_wellformed_query_classification
- task676_ollie_relationship_answer_generation
- task268_casehold_legal_answer_generation
- task844_financial_phrasebank_classification
- task330_gap_answer_generation
- task595_mocha_answer_generation
- task1285_kpa_keypoint_matching
- task234_iirc_passage_line_answer_generation
- task494_review_polarity_answer_generation
- task670_ambigqa_question_generation
- task289_gigaword_summarization
- npr
- nli
- SimpleWiki
- amazon_review_2018
- ccnews_title_text
- agnews
- xsum
- msmarco
- yahoo_answers_title_answer
- squad_pairs
- wow
- mteb-amazon_counterfactual-avs_triplets
- mteb-amazon_massive_intent-avs_triplets
- mteb-amazon_massive_scenario-avs_triplets
- mteb-amazon_reviews_multi-avs_triplets
- mteb-banking77-avs_triplets
- mteb-emotion-avs_triplets
- mteb-imdb-avs_triplets
- mteb-mtop_domain-avs_triplets
- mteb-mtop_intent-avs_triplets
- mteb-toxic_conversations_50k-avs_triplets
- mteb-tweet_sentiment_extraction-avs_triplets
- covid-bing-query-gpt4-avs_triplets
- Language: en
- License: apache-2.0
Model Sources
- Documentation: Sentence Transformers Documentation
- Repository: Sentence Transformers on GitHub
- Hugging Face: Sentence Transformers on Hugging Face
Full Model Architecture
SentenceTransformer(
(0): Transformer({'max_seq_length': 256, 'do_lower_case': False}) with Transformer model: BertModel
(1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
(2): Normalize()
)
Usage
Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("avsolatorio/all-MiniLM-L6-v2-MEDI-MTEB-triplet-final")
# Run inference
sentences = [
'who does george nelson represent in o brother where art thou',
'O Brother, Where Art Thou? omitted all instances of the words "damn" and "hell" from the Coens\' script, which only became known to Clooney after the directors pointed this out to him during shooting. This was the fourth film of the brothers in which John Turturro has starred. Other actors in "O Brother, Where Art Thou?" who had worked previously with the Coens include John Goodman (three films), Holly Hunter (two), Michael Badalucco and Charles Durning (one film each). The Coens used digital color correction to give the film a sepia-tinted look. Joel stated this was because the actual set was "greener than Ireland". Cinematographer',
'O Brother, Where Art Thou? the film got together and performed the music from the film in a Down from the Mountain concert tour which was filmed for TV and DVD. This included Ralph Stanley, John Hartford, Alison Krauss, Emmylou Harris, Gillian Welch, Chris Sharp, and others. O Brother, Where Art Thou? O Brother, Where Art Thou? is a 2000 crime comedy film written, produced, and directed by Joel and Ethan Coen, and starring George Clooney, John Turturro, and Tim Blake Nelson, with John Goodman, Holly Hunter, and Charles Durning in supporting roles. The film is set in 1937 rural Mississippi during the Great Depression.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 384]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
Evaluation
Metrics
Triplet
- Dataset:
medi-mteb-dev
- Evaluated with
TripletEvaluator
Metric | Value |
---|---|
cosine_accuracy | 0.9117 |
dot_accuracy | 0.081 |
manhattan_accuracy | 0.912 |
euclidean_accuracy | 0.9115 |
max_accuracy | 0.912 |
Training Details
Training Datasets
NQ
- Dataset: NQ
- Size: 49,676 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 10 tokens
- mean: 11.91 tokens
- max: 24 tokens
- min: 111 tokens
- mean: 137.95 tokens
- max: 212 tokens
- min: 113 tokens
- mean: 138.79 tokens
- max: 209 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
pubmed
- Dataset: pubmed
- Size: 29,908 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 6 tokens
- mean: 22.81 tokens
- max: 62 tokens
- min: 93 tokens
- mean: 240.49 tokens
- max: 256 tokens
- min: 73 tokens
- mean: 239.5 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
specter_train_triples
- Dataset: specter_train_triples
- Size: 49,676 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 4 tokens
- mean: 15.69 tokens
- max: 94 tokens
- min: 4 tokens
- mean: 14.12 tokens
- max: 39 tokens
- min: 4 tokens
- mean: 16.39 tokens
- max: 64 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
S2ORC_citations_abstracts
- Dataset: S2ORC_citations_abstracts
- Size: 99,352 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 20 tokens
- mean: 196.74 tokens
- max: 256 tokens
- min: 24 tokens
- mean: 203.91 tokens
- max: 256 tokens
- min: 24 tokens
- mean: 208.09 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
fever
- Dataset: fever
- Size: 74,514 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 6 tokens
- mean: 12.49 tokens
- max: 51 tokens
- min: 48 tokens
- mean: 112.67 tokens
- max: 154 tokens
- min: 35 tokens
- mean: 113.92 tokens
- max: 163 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
gooaq_pairs
- Dataset: gooaq_pairs
- Size: 24,838 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 8 tokens
- mean: 11.92 tokens
- max: 24 tokens
- min: 14 tokens
- mean: 60.11 tokens
- max: 150 tokens
- min: 15 tokens
- mean: 63.73 tokens
- max: 150 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
codesearchnet
- Dataset: codesearchnet
- Size: 15,210 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 5 tokens
- mean: 28.96 tokens
- max: 143 tokens
- min: 28 tokens
- mean: 134.91 tokens
- max: 256 tokens
- min: 29 tokens
- mean: 163.95 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
wikihow
- Dataset: wikihow
- Size: 5,070 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 4 tokens
- mean: 8.05 tokens
- max: 21 tokens
- min: 13 tokens
- mean: 45.27 tokens
- max: 117 tokens
- min: 10 tokens
- mean: 35.68 tokens
- max: 75 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
WikiAnswers
- Dataset: WikiAnswers
- Size: 24,838 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 6 tokens
- mean: 12.79 tokens
- max: 43 tokens
- min: 6 tokens
- mean: 12.93 tokens
- max: 47 tokens
- min: 6 tokens
- mean: 13.13 tokens
- max: 44 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
eli5_question_answer
- Dataset: eli5_question_answer
- Size: 24,838 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 5 tokens
- mean: 21.16 tokens
- max: 69 tokens
- min: 11 tokens
- mean: 100.92 tokens
- max: 256 tokens
- min: 13 tokens
- mean: 112.62 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
amazon-qa
- Dataset: amazon-qa
- Size: 99,352 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 6 tokens
- mean: 23.56 tokens
- max: 256 tokens
- min: 15 tokens
- mean: 52.4 tokens
- max: 256 tokens
- min: 18 tokens
- mean: 62.09 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
medmcqa
- Dataset: medmcqa
- Size: 29,908 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 4 tokens
- mean: 19.62 tokens
- max: 167 tokens
- min: 3 tokens
- mean: 110.24 tokens
- max: 256 tokens
- min: 3 tokens
- mean: 111.99 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
zeroshot
- Dataset: zeroshot
- Size: 15,210 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 5 tokens
- mean: 8.7 tokens
- max: 20 tokens
- min: 10 tokens
- mean: 112.73 tokens
- max: 178 tokens
- min: 14 tokens
- mean: 115.71 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
TriviaQA_pairs
- Dataset: TriviaQA_pairs
- Size: 49,676 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 8 tokens
- mean: 19.22 tokens
- max: 59 tokens
- min: 33 tokens
- mean: 246.01 tokens
- max: 256 tokens
- min: 21 tokens
- mean: 232.19 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
PAQ_pairs
- Dataset: PAQ_pairs
- Size: 24,838 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 8 tokens
- mean: 12.6 tokens
- max: 22 tokens
- min: 112 tokens
- mean: 136.78 tokens
- max: 205 tokens
- min: 110 tokens
- mean: 135.66 tokens
- max: 254 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
stackexchange_duplicate_questions_title-body_title-body
- Dataset: stackexchange_duplicate_questions_title-body_title-body
- Size: 24,838 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 18 tokens
- mean: 150.59 tokens
- max: 256 tokens
- min: 20 tokens
- mean: 142.04 tokens
- max: 256 tokens
- min: 27 tokens
- mean: 198.29 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
trex
- Dataset: trex
- Size: 29,908 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 5 tokens
- mean: 9.55 tokens
- max: 27 tokens
- min: 16 tokens
- mean: 104.71 tokens
- max: 212 tokens
- min: 14 tokens
- mean: 118.22 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
flickr30k_captions
- Dataset: flickr30k_captions
- Size: 24,838 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 7 tokens
- mean: 15.95 tokens
- max: 88 tokens
- min: 7 tokens
- mean: 15.68 tokens
- max: 59 tokens
- min: 7 tokens
- mean: 17.15 tokens
- max: 52 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
hotpotqa
- Dataset: hotpotqa
- Size: 40,048 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 8 tokens
- mean: 23.83 tokens
- max: 103 tokens
- min: 27 tokens
- mean: 113.6 tokens
- max: 194 tokens
- min: 38 tokens
- mean: 115.33 tokens
- max: 178 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task671_ambigqa_text_generation
- Dataset: task671_ambigqa_text_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 11 tokens
- mean: 12.69 tokens
- max: 26 tokens
- min: 11 tokens
- mean: 12.52 tokens
- max: 23 tokens
- min: 11 tokens
- mean: 12.23 tokens
- max: 19 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task061_ropes_answer_generation
- Dataset: task061_ropes_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 117 tokens
- mean: 208.96 tokens
- max: 256 tokens
- min: 117 tokens
- mean: 208.27 tokens
- max: 256 tokens
- min: 119 tokens
- mean: 210.46 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task285_imdb_answer_generation
- Dataset: task285_imdb_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 46 tokens
- mean: 208.78 tokens
- max: 256 tokens
- min: 49 tokens
- mean: 203.97 tokens
- max: 256 tokens
- min: 46 tokens
- mean: 208.78 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task905_hate_speech_offensive_classification
- Dataset: task905_hate_speech_offensive_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 15 tokens
- mean: 41.73 tokens
- max: 164 tokens
- min: 13 tokens
- mean: 40.48 tokens
- max: 198 tokens
- min: 13 tokens
- mean: 32.23 tokens
- max: 135 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task566_circa_classification
- Dataset: task566_circa_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 20 tokens
- mean: 27.77 tokens
- max: 48 tokens
- min: 19 tokens
- mean: 27.22 tokens
- max: 44 tokens
- min: 20 tokens
- mean: 27.46 tokens
- max: 47 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task184_snli_entailment_to_neutral_text_modification
- Dataset: task184_snli_entailment_to_neutral_text_modification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 17 tokens
- mean: 29.98 tokens
- max: 72 tokens
- min: 16 tokens
- mean: 28.9 tokens
- max: 60 tokens
- min: 17 tokens
- mean: 30.33 tokens
- max: 100 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task280_stereoset_classification_stereotype_type
- Dataset: task280_stereoset_classification_stereotype_type
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 8 tokens
- mean: 18.47 tokens
- max: 53 tokens
- min: 8 tokens
- mean: 16.89 tokens
- max: 53 tokens
- min: 8 tokens
- mean: 16.86 tokens
- max: 51 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1599_smcalflow_classification
- Dataset: task1599_smcalflow_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 3 tokens
- mean: 11.25 tokens
- max: 37 tokens
- min: 3 tokens
- mean: 10.47 tokens
- max: 38 tokens
- min: 5 tokens
- mean: 16.12 tokens
- max: 45 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1384_deal_or_no_dialog_classification
- Dataset: task1384_deal_or_no_dialog_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 14 tokens
- mean: 59.1 tokens
- max: 256 tokens
- min: 12 tokens
- mean: 59.35 tokens
- max: 256 tokens
- min: 15 tokens
- mean: 58.47 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task591_sciq_answer_generation
- Dataset: task591_sciq_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 8 tokens
- mean: 17.61 tokens
- max: 70 tokens
- min: 7 tokens
- mean: 17.17 tokens
- max: 43 tokens
- min: 6 tokens
- mean: 16.67 tokens
- max: 75 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task823_peixian-rtgender_sentiment_analysis
- Dataset: task823_peixian-rtgender_sentiment_analysis
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 16 tokens
- mean: 57.26 tokens
- max: 179 tokens
- min: 16 tokens
- mean: 60.03 tokens
- max: 153 tokens
- min: 14 tokens
- mean: 60.89 tokens
- max: 169 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task023_cosmosqa_question_generation
- Dataset: task023_cosmosqa_question_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 35 tokens
- mean: 79.52 tokens
- max: 159 tokens
- min: 34 tokens
- mean: 80.36 tokens
- max: 165 tokens
- min: 35 tokens
- mean: 79.14 tokens
- max: 161 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task900_freebase_qa_category_classification
- Dataset: task900_freebase_qa_category_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 8 tokens
- mean: 20.44 tokens
- max: 88 tokens
- min: 8 tokens
- mean: 18.33 tokens
- max: 62 tokens
- min: 8 tokens
- mean: 19.14 tokens
- max: 69 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task924_event2mind_word_generation
- Dataset: task924_event2mind_word_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 17 tokens
- mean: 32.06 tokens
- max: 64 tokens
- min: 17 tokens
- mean: 32.13 tokens
- max: 70 tokens
- min: 17 tokens
- mean: 31.58 tokens
- max: 68 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task152_tomqa_find_location_easy_noise
- Dataset: task152_tomqa_find_location_easy_noise
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 37 tokens
- mean: 52.96 tokens
- max: 79 tokens
- min: 37 tokens
- mean: 52.53 tokens
- max: 78 tokens
- min: 37 tokens
- mean: 52.92 tokens
- max: 82 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1368_healthfact_sentence_generation
- Dataset: task1368_healthfact_sentence_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 91 tokens
- mean: 240.57 tokens
- max: 256 tokens
- min: 84 tokens
- mean: 239.31 tokens
- max: 256 tokens
- min: 97 tokens
- mean: 245.05 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1661_super_glue_classification
- Dataset: task1661_super_glue_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 35 tokens
- mean: 140.99 tokens
- max: 256 tokens
- min: 31 tokens
- mean: 142.44 tokens
- max: 256 tokens
- min: 31 tokens
- mean: 143.37 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1187_politifact_classification
- Dataset: task1187_politifact_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 14 tokens
- mean: 33.28 tokens
- max: 79 tokens
- min: 10 tokens
- mean: 31.59 tokens
- max: 75 tokens
- min: 13 tokens
- mean: 31.9 tokens
- max: 71 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1728_web_nlg_data_to_text
- Dataset: task1728_web_nlg_data_to_text
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 7 tokens
- mean: 43.07 tokens
- max: 152 tokens
- min: 7 tokens
- mean: 46.55 tokens
- max: 152 tokens
- min: 8 tokens
- mean: 43.18 tokens
- max: 152 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task112_asset_simple_sentence_identification
- Dataset: task112_asset_simple_sentence_identification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 18 tokens
- mean: 51.87 tokens
- max: 136 tokens
- min: 18 tokens
- mean: 51.68 tokens
- max: 144 tokens
- min: 22 tokens
- mean: 51.93 tokens
- max: 114 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1340_msr_text_compression_compression
- Dataset: task1340_msr_text_compression_compression
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 14 tokens
- mean: 41.77 tokens
- max: 116 tokens
- min: 14 tokens
- mean: 44.27 tokens
- max: 133 tokens
- min: 12 tokens
- mean: 40.08 tokens
- max: 141 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task072_abductivenli_answer_generation
- Dataset: task072_abductivenli_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 17 tokens
- mean: 26.8 tokens
- max: 56 tokens
- min: 16 tokens
- mean: 26.15 tokens
- max: 47 tokens
- min: 16 tokens
- mean: 26.4 tokens
- max: 55 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1504_hatexplain_answer_generation
- Dataset: task1504_hatexplain_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 7 tokens
- mean: 28.53 tokens
- max: 72 tokens
- min: 5 tokens
- mean: 24.21 tokens
- max: 86 tokens
- min: 5 tokens
- mean: 27.94 tokens
- max: 67 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task684_online_privacy_policy_text_information_type_generation
- Dataset: task684_online_privacy_policy_text_information_type_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 10 tokens
- mean: 29.91 tokens
- max: 68 tokens
- min: 10 tokens
- mean: 30.18 tokens
- max: 61 tokens
- min: 14 tokens
- mean: 30.06 tokens
- max: 68 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1290_xsum_summarization
- Dataset: task1290_xsum_summarization
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 39 tokens
- mean: 226.28 tokens
- max: 256 tokens
- min: 50 tokens
- mean: 229.51 tokens
- max: 256 tokens
- min: 34 tokens
- mean: 229.59 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task075_squad1.1_answer_generation
- Dataset: task075_squad1.1_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 48 tokens
- mean: 167.12 tokens
- max: 256 tokens
- min: 45 tokens
- mean: 173.01 tokens
- max: 256 tokens
- min: 46 tokens
- mean: 178.89 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1587_scifact_classification
- Dataset: task1587_scifact_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 88 tokens
- mean: 242.08 tokens
- max: 256 tokens
- min: 90 tokens
- mean: 246.93 tokens
- max: 256 tokens
- min: 86 tokens
- mean: 244.36 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task384_socialiqa_question_classification
- Dataset: task384_socialiqa_question_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 24 tokens
- mean: 35.46 tokens
- max: 78 tokens
- min: 22 tokens
- mean: 34.33 tokens
- max: 59 tokens
- min: 22 tokens
- mean: 34.52 tokens
- max: 57 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1555_scitail_answer_generation
- Dataset: task1555_scitail_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 18 tokens
- mean: 36.88 tokens
- max: 90 tokens
- min: 18 tokens
- mean: 36.12 tokens
- max: 80 tokens
- min: 18 tokens
- mean: 36.59 tokens
- max: 92 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1532_daily_dialog_emotion_classification
- Dataset: task1532_daily_dialog_emotion_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 16 tokens
- mean: 135.8 tokens
- max: 256 tokens
- min: 15 tokens
- mean: 140.06 tokens
- max: 256 tokens
- min: 17 tokens
- mean: 134.53 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task239_tweetqa_answer_generation
- Dataset: task239_tweetqa_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 28 tokens
- mean: 56.05 tokens
- max: 91 tokens
- min: 29 tokens
- mean: 56.59 tokens
- max: 92 tokens
- min: 25 tokens
- mean: 56.05 tokens
- max: 81 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task596_mocha_question_generation
- Dataset: task596_mocha_question_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 34 tokens
- mean: 80.75 tokens
- max: 163 tokens
- min: 12 tokens
- mean: 96.06 tokens
- max: 256 tokens
- min: 10 tokens
- mean: 45.02 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1411_dart_subject_identification
- Dataset: task1411_dart_subject_identification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 7 tokens
- mean: 15.01 tokens
- max: 74 tokens
- min: 6 tokens
- mean: 14.1 tokens
- max: 37 tokens
- min: 6 tokens
- mean: 14.36 tokens
- max: 38 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1359_numer_sense_answer_generation
- Dataset: task1359_numer_sense_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 10 tokens
- mean: 18.75 tokens
- max: 30 tokens
- min: 10 tokens
- mean: 18.43 tokens
- max: 33 tokens
- min: 10 tokens
- mean: 18.3 tokens
- max: 30 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task329_gap_classification
- Dataset: task329_gap_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 40 tokens
- mean: 123.98 tokens
- max: 256 tokens
- min: 62 tokens
- mean: 127.04 tokens
- max: 256 tokens
- min: 58 tokens
- mean: 128.35 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task220_rocstories_title_classification
- Dataset: task220_rocstories_title_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 53 tokens
- mean: 80.81 tokens
- max: 116 tokens
- min: 51 tokens
- mean: 81.14 tokens
- max: 108 tokens
- min: 55 tokens
- mean: 79.79 tokens
- max: 115 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task316_crows-pairs_classification_stereotype
- Dataset: task316_crows-pairs_classification_stereotype
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 8 tokens
- mean: 19.78 tokens
- max: 51 tokens
- min: 7 tokens
- mean: 18.35 tokens
- max: 41 tokens
- min: 7 tokens
- mean: 19.82 tokens
- max: 52 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task495_semeval_headline_classification
- Dataset: task495_semeval_headline_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 17 tokens
- mean: 24.57 tokens
- max: 42 tokens
- min: 15 tokens
- mean: 24.23 tokens
- max: 41 tokens
- min: 15 tokens
- mean: 24.2 tokens
- max: 38 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1168_brown_coarse_pos_tagging
- Dataset: task1168_brown_coarse_pos_tagging
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 13 tokens
- mean: 43.83 tokens
- max: 142 tokens
- min: 12 tokens
- mean: 43.44 tokens
- max: 197 tokens
- min: 12 tokens
- mean: 44.95 tokens
- max: 197 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task348_squad2.0_unanswerable_question_generation
- Dataset: task348_squad2.0_unanswerable_question_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 30 tokens
- mean: 153.01 tokens
- max: 256 tokens
- min: 38 tokens
- mean: 161.19 tokens
- max: 256 tokens
- min: 33 tokens
- mean: 167.06 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task049_multirc_questions_needed_to_answer
- Dataset: task049_multirc_questions_needed_to_answer
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 174 tokens
- mean: 252.54 tokens
- max: 256 tokens
- min: 169 tokens
- mean: 252.57 tokens
- max: 256 tokens
- min: 178 tokens
- mean: 252.73 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1534_daily_dialog_question_classification
- Dataset: task1534_daily_dialog_question_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 17 tokens
- mean: 125.31 tokens
- max: 256 tokens
- min: 15 tokens
- mean: 130.35 tokens
- max: 256 tokens
- min: 16 tokens
- mean: 135.56 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task322_jigsaw_classification_threat
- Dataset: task322_jigsaw_classification_threat
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 7 tokens
- mean: 54.84 tokens
- max: 256 tokens
- min: 6 tokens
- mean: 62.09 tokens
- max: 249 tokens
- min: 6 tokens
- mean: 62.43 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task295_semeval_2020_task4_commonsense_reasoning
- Dataset: task295_semeval_2020_task4_commonsense_reasoning
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 25 tokens
- mean: 44.81 tokens
- max: 92 tokens
- min: 25 tokens
- mean: 45.07 tokens
- max: 95 tokens
- min: 25 tokens
- mean: 44.7 tokens
- max: 88 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task186_snli_contradiction_to_entailment_text_modification
- Dataset: task186_snli_contradiction_to_entailment_text_modification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 18 tokens
- mean: 31.21 tokens
- max: 102 tokens
- min: 18 tokens
- mean: 30.13 tokens
- max: 65 tokens
- min: 18 tokens
- mean: 32.21 tokens
- max: 67 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task034_winogrande_question_modification_object
- Dataset: task034_winogrande_question_modification_object
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 29 tokens
- mean: 36.36 tokens
- max: 53 tokens
- min: 29 tokens
- mean: 35.59 tokens
- max: 54 tokens
- min: 29 tokens
- mean: 34.87 tokens
- max: 55 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task160_replace_letter_in_a_sentence
- Dataset: task160_replace_letter_in_a_sentence
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 29 tokens
- mean: 31.98 tokens
- max: 49 tokens
- min: 28 tokens
- mean: 31.78 tokens
- max: 41 tokens
- min: 29 tokens
- mean: 31.8 tokens
- max: 48 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task469_mrqa_answer_generation
- Dataset: task469_mrqa_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 27 tokens
- mean: 182.22 tokens
- max: 256 tokens
- min: 25 tokens
- mean: 180.87 tokens
- max: 256 tokens
- min: 27 tokens
- mean: 184.07 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task105_story_cloze-rocstories_sentence_generation
- Dataset: task105_story_cloze-rocstories_sentence_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 36 tokens
- mean: 55.58 tokens
- max: 75 tokens
- min: 35 tokens
- mean: 54.96 tokens
- max: 76 tokens
- min: 36 tokens
- mean: 55.99 tokens
- max: 76 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task649_race_blank_question_generation
- Dataset: task649_race_blank_question_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 36 tokens
- mean: 253.19 tokens
- max: 256 tokens
- min: 36 tokens
- mean: 252.56 tokens
- max: 256 tokens
- min: 157 tokens
- mean: 254.12 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1536_daily_dialog_happiness_classification
- Dataset: task1536_daily_dialog_happiness_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 13 tokens
- mean: 127.06 tokens
- max: 256 tokens
- min: 13 tokens
- mean: 133.94 tokens
- max: 256 tokens
- min: 16 tokens
- mean: 142.64 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task683_online_privacy_policy_text_purpose_answer_generation
- Dataset: task683_online_privacy_policy_text_purpose_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 10 tokens
- mean: 29.93 tokens
- max: 68 tokens
- min: 10 tokens
- mean: 30.22 tokens
- max: 64 tokens
- min: 14 tokens
- mean: 29.85 tokens
- max: 68 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task024_cosmosqa_answer_generation
- Dataset: task024_cosmosqa_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 45 tokens
- mean: 92.5 tokens
- max: 176 tokens
- min: 47 tokens
- mean: 93.22 tokens
- max: 174 tokens
- min: 42 tokens
- mean: 94.89 tokens
- max: 183 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task584_udeps_eng_fine_pos_tagging
- Dataset: task584_udeps_eng_fine_pos_tagging
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 12 tokens
- mean: 40.13 tokens
- max: 120 tokens
- min: 12 tokens
- mean: 39.18 tokens
- max: 186 tokens
- min: 12 tokens
- mean: 40.4 tokens
- max: 148 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task066_timetravel_binary_consistency_classification
- Dataset: task066_timetravel_binary_consistency_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 42 tokens
- mean: 66.89 tokens
- max: 93 tokens
- min: 43 tokens
- mean: 67.42 tokens
- max: 94 tokens
- min: 45 tokens
- mean: 67.0 tokens
- max: 92 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task413_mickey_en_sentence_perturbation_generation
- Dataset: task413_mickey_en_sentence_perturbation_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 7 tokens
- mean: 13.77 tokens
- max: 21 tokens
- min: 7 tokens
- mean: 13.82 tokens
- max: 21 tokens
- min: 7 tokens
- mean: 13.31 tokens
- max: 20 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task182_duorc_question_generation
- Dataset: task182_duorc_question_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 99 tokens
- mean: 241.8 tokens
- max: 256 tokens
- min: 120 tokens
- mean: 245.95 tokens
- max: 256 tokens
- min: 99 tokens
- mean: 246.6 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task028_drop_answer_generation
- Dataset: task028_drop_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 76 tokens
- mean: 230.72 tokens
- max: 256 tokens
- min: 86 tokens
- mean: 234.59 tokens
- max: 256 tokens
- min: 81 tokens
- mean: 235.71 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1601_webquestions_answer_generation
- Dataset: task1601_webquestions_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 9 tokens
- mean: 16.47 tokens
- max: 28 tokens
- min: 11 tokens
- mean: 16.67 tokens
- max: 28 tokens
- min: 9 tokens
- mean: 16.76 tokens
- max: 27 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1295_adversarial_qa_question_answering
- Dataset: task1295_adversarial_qa_question_answering
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 45 tokens
- mean: 165.1 tokens
- max: 256 tokens
- min: 54 tokens
- mean: 167.21 tokens
- max: 256 tokens
- min: 48 tokens
- mean: 166.49 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task201_mnli_neutral_classification
- Dataset: task201_mnli_neutral_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 24 tokens
- mean: 73.0 tokens
- max: 218 tokens
- min: 25 tokens
- mean: 73.42 tokens
- max: 170 tokens
- min: 27 tokens
- mean: 72.48 tokens
- max: 205 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task038_qasc_combined_fact
- Dataset: task038_qasc_combined_fact
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 18 tokens
- mean: 31.3 tokens
- max: 57 tokens
- min: 19 tokens
- mean: 30.49 tokens
- max: 53 tokens
- min: 18 tokens
- mean: 30.87 tokens
- max: 53 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task293_storycommonsense_emotion_text_generation
- Dataset: task293_storycommonsense_emotion_text_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 14 tokens
- mean: 40.74 tokens
- max: 86 tokens
- min: 15 tokens
- mean: 40.56 tokens
- max: 86 tokens
- min: 14 tokens
- mean: 38.5 tokens
- max: 86 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task572_recipe_nlg_text_generation
- Dataset: task572_recipe_nlg_text_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 24 tokens
- mean: 114.82 tokens
- max: 256 tokens
- min: 24 tokens
- mean: 121.93 tokens
- max: 256 tokens
- min: 24 tokens
- mean: 124.38 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task517_emo_classify_emotion_of_dialogue
- Dataset: task517_emo_classify_emotion_of_dialogue
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 7 tokens
- mean: 18.18 tokens
- max: 78 tokens
- min: 7 tokens
- mean: 17.03 tokens
- max: 59 tokens
- min: 7 tokens
- mean: 18.39 tokens
- max: 67 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task382_hybridqa_answer_generation
- Dataset: task382_hybridqa_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 29 tokens
- mean: 42.34 tokens
- max: 70 tokens
- min: 29 tokens
- mean: 41.63 tokens
- max: 74 tokens
- min: 28 tokens
- mean: 41.73 tokens
- max: 75 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task176_break_decompose_questions
- Dataset: task176_break_decompose_questions
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 9 tokens
- mean: 17.39 tokens
- max: 41 tokens
- min: 8 tokens
- mean: 17.19 tokens
- max: 39 tokens
- min: 8 tokens
- mean: 15.71 tokens
- max: 38 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1291_multi_news_summarization
- Dataset: task1291_multi_news_summarization
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 116 tokens
- mean: 255.36 tokens
- max: 256 tokens
- min: 146 tokens
- mean: 255.71 tokens
- max: 256 tokens
- min: 68 tokens
- mean: 252.09 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task155_count_nouns_verbs
- Dataset: task155_count_nouns_verbs
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 23 tokens
- mean: 27.03 tokens
- max: 56 tokens
- min: 23 tokens
- mean: 26.8 tokens
- max: 43 tokens
- min: 23 tokens
- mean: 26.94 tokens
- max: 46 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task031_winogrande_question_generation_object
- Dataset: task031_winogrande_question_generation_object
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 7 tokens
- mean: 7.42 tokens
- max: 11 tokens
- min: 7 tokens
- mean: 7.31 tokens
- max: 11 tokens
- min: 7 tokens
- mean: 7.27 tokens
- max: 11 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task279_stereoset_classification_stereotype
- Dataset: task279_stereoset_classification_stereotype
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 8 tokens
- mean: 17.91 tokens
- max: 41 tokens
- min: 8 tokens
- mean: 15.43 tokens
- max: 43 tokens
- min: 8 tokens
- mean: 17.2 tokens
- max: 50 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1336_peixian_equity_evaluation_corpus_gender_classifier
- Dataset: task1336_peixian_equity_evaluation_corpus_gender_classifier
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 6 tokens
- mean: 9.62 tokens
- max: 17 tokens
- min: 6 tokens
- mean: 9.6 tokens
- max: 16 tokens
- min: 6 tokens
- mean: 9.69 tokens
- max: 16 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task508_scruples_dilemmas_more_ethical_isidentifiable
- Dataset: task508_scruples_dilemmas_more_ethical_isidentifiable
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 12 tokens
- mean: 29.63 tokens
- max: 94 tokens
- min: 12 tokens
- mean: 28.69 tokens
- max: 94 tokens
- min: 12 tokens
- mean: 28.59 tokens
- max: 86 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task518_emo_different_dialogue_emotions
- Dataset: task518_emo_different_dialogue_emotions
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 28 tokens
- mean: 47.83 tokens
- max: 106 tokens
- min: 28 tokens
- mean: 45.51 tokens
- max: 116 tokens
- min: 26 tokens
- mean: 45.81 tokens
- max: 123 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task077_splash_explanation_to_sql
- Dataset: task077_splash_explanation_to_sql
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 8 tokens
- mean: 39.82 tokens
- max: 126 tokens
- min: 8 tokens
- mean: 39.88 tokens
- max: 126 tokens
- min: 8 tokens
- mean: 35.83 tokens
- max: 111 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task923_event2mind_classifier
- Dataset: task923_event2mind_classifier
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 10 tokens
- mean: 20.61 tokens
- max: 46 tokens
- min: 11 tokens
- mean: 18.62 tokens
- max: 41 tokens
- min: 11 tokens
- mean: 19.51 tokens
- max: 46 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task470_mrqa_question_generation
- Dataset: task470_mrqa_question_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 13 tokens
- mean: 172.18 tokens
- max: 256 tokens
- min: 11 tokens
- mean: 175.43 tokens
- max: 256 tokens
- min: 14 tokens
- mean: 180.36 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task638_multi_woz_classification
- Dataset: task638_multi_woz_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 78 tokens
- mean: 223.56 tokens
- max: 256 tokens
- min: 76 tokens
- mean: 220.51 tokens
- max: 256 tokens
- min: 64 tokens
- mean: 220.0 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1412_web_questions_question_answering
- Dataset: task1412_web_questions_question_answering
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 6 tokens
- mean: 10.33 tokens
- max: 17 tokens
- min: 6 tokens
- mean: 10.18 tokens
- max: 17 tokens
- min: 6 tokens
- mean: 10.08 tokens
- max: 16 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task847_pubmedqa_question_generation
- Dataset: task847_pubmedqa_question_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 21 tokens
- mean: 248.66 tokens
- max: 256 tokens
- min: 21 tokens
- mean: 248.78 tokens
- max: 256 tokens
- min: 43 tokens
- mean: 249.11 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task678_ollie_actual_relationship_answer_generation
- Dataset: task678_ollie_actual_relationship_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 20 tokens
- mean: 41.01 tokens
- max: 95 tokens
- min: 19 tokens
- mean: 37.95 tokens
- max: 102 tokens
- min: 18 tokens
- mean: 41.14 tokens
- max: 104 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task290_tellmewhy_question_answerability
- Dataset: task290_tellmewhy_question_answerability
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 37 tokens
- mean: 63.19 tokens
- max: 95 tokens
- min: 36 tokens
- mean: 62.66 tokens
- max: 94 tokens
- min: 37 tokens
- mean: 63.44 tokens
- max: 95 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task575_air_dialogue_classification
- Dataset: task575_air_dialogue_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 4 tokens
- mean: 14.16 tokens
- max: 45 tokens
- min: 4 tokens
- mean: 13.55 tokens
- max: 43 tokens
- min: 4 tokens
- mean: 12.3 tokens
- max: 42 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task189_snli_neutral_to_contradiction_text_modification
- Dataset: task189_snli_neutral_to_contradiction_text_modification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 18 tokens
- mean: 31.82 tokens
- max: 60 tokens
- min: 18 tokens
- mean: 30.75 tokens
- max: 57 tokens
- min: 18 tokens
- mean: 33.25 tokens
- max: 105 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task026_drop_question_generation
- Dataset: task026_drop_question_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 82 tokens
- mean: 219.39 tokens
- max: 256 tokens
- min: 57 tokens
- mean: 222.63 tokens
- max: 256 tokens
- min: 96 tokens
- mean: 232.08 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task162_count_words_starting_with_letter
- Dataset: task162_count_words_starting_with_letter
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 28 tokens
- mean: 32.21 tokens
- max: 56 tokens
- min: 28 tokens
- mean: 31.77 tokens
- max: 45 tokens
- min: 28 tokens
- mean: 31.64 tokens
- max: 46 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task079_conala_concat_strings
- Dataset: task079_conala_concat_strings
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 11 tokens
- mean: 39.62 tokens
- max: 76 tokens
- min: 11 tokens
- mean: 34.2 tokens
- max: 80 tokens
- min: 11 tokens
- mean: 33.53 tokens
- max: 76 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task610_conllpp_ner
- Dataset: task610_conllpp_ner
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 4 tokens
- mean: 19.55 tokens
- max: 62 tokens
- min: 4 tokens
- mean: 20.27 tokens
- max: 62 tokens
- min: 4 tokens
- mean: 14.12 tokens
- max: 54 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task046_miscellaneous_question_typing
- Dataset: task046_miscellaneous_question_typing
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 16 tokens
- mean: 25.41 tokens
- max: 70 tokens
- min: 16 tokens
- mean: 24.94 tokens
- max: 70 tokens
- min: 16 tokens
- mean: 25.13 tokens
- max: 57 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task197_mnli_domain_answer_generation
- Dataset: task197_mnli_domain_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 15 tokens
- mean: 44.09 tokens
- max: 197 tokens
- min: 12 tokens
- mean: 44.97 tokens
- max: 211 tokens
- min: 11 tokens
- mean: 39.22 tokens
- max: 115 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1325_qa_zre_question_generation_on_subject_relation
- Dataset: task1325_qa_zre_question_generation_on_subject_relation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 18 tokens
- mean: 51.02 tokens
- max: 256 tokens
- min: 20 tokens
- mean: 49.57 tokens
- max: 180 tokens
- min: 22 tokens
- mean: 54.59 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task430_senteval_subject_count
- Dataset: task430_senteval_subject_count
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 7 tokens
- mean: 17.14 tokens
- max: 35 tokens
- min: 7 tokens
- mean: 15.31 tokens
- max: 34 tokens
- min: 7 tokens
- mean: 16.13 tokens
- max: 34 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task672_nummersense
- Dataset: task672_nummersense
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 7 tokens
- mean: 15.72 tokens
- max: 30 tokens
- min: 7 tokens
- mean: 15.33 tokens
- max: 27 tokens
- min: 7 tokens
- mean: 15.21 tokens
- max: 30 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task402_grailqa_paraphrase_generation
- Dataset: task402_grailqa_paraphrase_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 23 tokens
- mean: 127.55 tokens
- max: 256 tokens
- min: 24 tokens
- mean: 139.34 tokens
- max: 256 tokens
- min: 22 tokens
- mean: 133.69 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task904_hate_speech_offensive_classification
- Dataset: task904_hate_speech_offensive_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 8 tokens
- mean: 35.03 tokens
- max: 157 tokens
- min: 8 tokens
- mean: 34.67 tokens
- max: 256 tokens
- min: 5 tokens
- mean: 27.84 tokens
- max: 148 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task192_hotpotqa_sentence_generation
- Dataset: task192_hotpotqa_sentence_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 37 tokens
- mean: 125.55 tokens
- max: 256 tokens
- min: 35 tokens
- mean: 123.85 tokens
- max: 256 tokens
- min: 33 tokens
- mean: 134.16 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task069_abductivenli_classification
- Dataset: task069_abductivenli_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 33 tokens
- mean: 52.09 tokens
- max: 86 tokens
- min: 33 tokens
- mean: 52.16 tokens
- max: 95 tokens
- min: 33 tokens
- mean: 51.84 tokens
- max: 95 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task574_air_dialogue_sentence_generation
- Dataset: task574_air_dialogue_sentence_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 54 tokens
- mean: 143.98 tokens
- max: 256 tokens
- min: 57 tokens
- mean: 143.52 tokens
- max: 256 tokens
- min: 66 tokens
- mean: 147.45 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task187_snli_entailment_to_contradiction_text_modification
- Dataset: task187_snli_entailment_to_contradiction_text_modification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 16 tokens
- mean: 30.23 tokens
- max: 69 tokens
- min: 16 tokens
- mean: 29.82 tokens
- max: 104 tokens
- min: 17 tokens
- mean: 29.44 tokens
- max: 71 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task749_glucose_reverse_cause_emotion_detection
- Dataset: task749_glucose_reverse_cause_emotion_detection
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 38 tokens
- mean: 67.61 tokens
- max: 106 tokens
- min: 37 tokens
- mean: 67.14 tokens
- max: 104 tokens
- min: 39 tokens
- mean: 68.46 tokens
- max: 107 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1552_scitail_question_generation
- Dataset: task1552_scitail_question_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 7 tokens
- mean: 18.37 tokens
- max: 53 tokens
- min: 7 tokens
- mean: 17.55 tokens
- max: 46 tokens
- min: 7 tokens
- mean: 15.88 tokens
- max: 54 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task750_aqua_multiple_choice_answering
- Dataset: task750_aqua_multiple_choice_answering
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 33 tokens
- mean: 69.62 tokens
- max: 194 tokens
- min: 32 tokens
- mean: 67.98 tokens
- max: 194 tokens
- min: 28 tokens
- mean: 67.81 tokens
- max: 165 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task327_jigsaw_classification_toxic
- Dataset: task327_jigsaw_classification_toxic
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 5 tokens
- mean: 36.8 tokens
- max: 234 tokens
- min: 5 tokens
- mean: 40.85 tokens
- max: 256 tokens
- min: 5 tokens
- mean: 45.53 tokens
- max: 244 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1502_hatexplain_classification
- Dataset: task1502_hatexplain_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 5 tokens
- mean: 28.69 tokens
- max: 73 tokens
- min: 5 tokens
- mean: 26.7 tokens
- max: 110 tokens
- min: 5 tokens
- mean: 26.92 tokens
- max: 90 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task328_jigsaw_classification_insult
- Dataset: task328_jigsaw_classification_insult
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 5 tokens
- mean: 51.02 tokens
- max: 247 tokens
- min: 5 tokens
- mean: 60.56 tokens
- max: 256 tokens
- min: 5 tokens
- mean: 64.19 tokens
- max: 249 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task304_numeric_fused_head_resolution
- Dataset: task304_numeric_fused_head_resolution
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 15 tokens
- mean: 120.75 tokens
- max: 256 tokens
- min: 12 tokens
- mean: 122.1 tokens
- max: 256 tokens
- min: 11 tokens
- mean: 134.06 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1293_kilt_tasks_hotpotqa_question_answering
- Dataset: task1293_kilt_tasks_hotpotqa_question_answering
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 10 tokens
- mean: 24.78 tokens
- max: 114 tokens
- min: 9 tokens
- mean: 24.2 tokens
- max: 114 tokens
- min: 8 tokens
- mean: 23.85 tokens
- max: 84 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task216_rocstories_correct_answer_generation
- Dataset: task216_rocstories_correct_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 39 tokens
- mean: 59.5 tokens
- max: 83 tokens
- min: 36 tokens
- mean: 58.38 tokens
- max: 92 tokens
- min: 39 tokens
- mean: 58.22 tokens
- max: 95 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1326_qa_zre_question_generation_from_answer
- Dataset: task1326_qa_zre_question_generation_from_answer
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 17 tokens
- mean: 46.37 tokens
- max: 256 tokens
- min: 14 tokens
- mean: 45.05 tokens
- max: 256 tokens
- min: 18 tokens
- mean: 49.47 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1338_peixian_equity_evaluation_corpus_sentiment_classifier
- Dataset: task1338_peixian_equity_evaluation_corpus_sentiment_classifier
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 6 tokens
- mean: 9.68 tokens
- max: 16 tokens
- min: 6 tokens
- mean: 9.71 tokens
- max: 16 tokens
- min: 6 tokens
- mean: 9.57 tokens
- max: 17 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1729_personachat_generate_next
- Dataset: task1729_personachat_generate_next
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 44 tokens
- mean: 146.46 tokens
- max: 256 tokens
- min: 43 tokens
- mean: 142.09 tokens
- max: 256 tokens
- min: 50 tokens
- mean: 144.22 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1202_atomic_classification_xneed
- Dataset: task1202_atomic_classification_xneed
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 14 tokens
- mean: 19.55 tokens
- max: 32 tokens
- min: 14 tokens
- mean: 19.39 tokens
- max: 31 tokens
- min: 14 tokens
- mean: 19.22 tokens
- max: 28 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task400_paws_paraphrase_classification
- Dataset: task400_paws_paraphrase_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 19 tokens
- mean: 52.28 tokens
- max: 97 tokens
- min: 18 tokens
- mean: 51.88 tokens
- max: 98 tokens
- min: 19 tokens
- mean: 53.03 tokens
- max: 97 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task502_scruples_anecdotes_whoiswrong_verification
- Dataset: task502_scruples_anecdotes_whoiswrong_verification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 12 tokens
- mean: 229.76 tokens
- max: 256 tokens
- min: 12 tokens
- mean: 236.43 tokens
- max: 256 tokens
- min: 23 tokens
- mean: 235.02 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task088_identify_typo_verification
- Dataset: task088_identify_typo_verification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 11 tokens
- mean: 15.08 tokens
- max: 48 tokens
- min: 10 tokens
- mean: 15.05 tokens
- max: 47 tokens
- min: 10 tokens
- mean: 15.39 tokens
- max: 47 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task221_rocstories_two_choice_classification
- Dataset: task221_rocstories_two_choice_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 47 tokens
- mean: 72.64 tokens
- max: 108 tokens
- min: 48 tokens
- mean: 72.66 tokens
- max: 109 tokens
- min: 46 tokens
- mean: 73.26 tokens
- max: 108 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task200_mnli_entailment_classification
- Dataset: task200_mnli_entailment_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 24 tokens
- mean: 72.63 tokens
- max: 198 tokens
- min: 23 tokens
- mean: 72.69 tokens
- max: 224 tokens
- min: 23 tokens
- mean: 73.44 tokens
- max: 226 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task074_squad1.1_question_generation
- Dataset: task074_squad1.1_question_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 30 tokens
- mean: 150.23 tokens
- max: 256 tokens
- min: 33 tokens
- mean: 160.48 tokens
- max: 256 tokens
- min: 38 tokens
- mean: 164.59 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task581_socialiqa_question_generation
- Dataset: task581_socialiqa_question_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 12 tokens
- mean: 26.52 tokens
- max: 69 tokens
- min: 14 tokens
- mean: 25.55 tokens
- max: 48 tokens
- min: 15 tokens
- mean: 25.85 tokens
- max: 48 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1186_nne_hrngo_classification
- Dataset: task1186_nne_hrngo_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 19 tokens
- mean: 33.82 tokens
- max: 79 tokens
- min: 19 tokens
- mean: 33.49 tokens
- max: 74 tokens
- min: 20 tokens
- mean: 33.34 tokens
- max: 77 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task898_freebase_qa_answer_generation
- Dataset: task898_freebase_qa_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 8 tokens
- mean: 19.18 tokens
- max: 125 tokens
- min: 8 tokens
- mean: 17.45 tokens
- max: 49 tokens
- min: 8 tokens
- mean: 17.48 tokens
- max: 79 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1408_dart_similarity_classification
- Dataset: task1408_dart_similarity_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 22 tokens
- mean: 59.48 tokens
- max: 147 tokens
- min: 22 tokens
- mean: 61.95 tokens
- max: 154 tokens
- min: 20 tokens
- mean: 48.32 tokens
- max: 124 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task168_strategyqa_question_decomposition
- Dataset: task168_strategyqa_question_decomposition
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 42 tokens
- mean: 81.83 tokens
- max: 181 tokens
- min: 42 tokens
- mean: 79.75 tokens
- max: 179 tokens
- min: 42 tokens
- mean: 77.43 tokens
- max: 166 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1357_xlsum_summary_generation
- Dataset: task1357_xlsum_summary_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 67 tokens
- mean: 242.04 tokens
- max: 256 tokens
- min: 76 tokens
- mean: 243.28 tokens
- max: 256 tokens
- min: 67 tokens
- mean: 247.07 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task390_torque_text_span_selection
- Dataset: task390_torque_text_span_selection
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 47 tokens
- mean: 110.04 tokens
- max: 196 tokens
- min: 42 tokens
- mean: 110.49 tokens
- max: 195 tokens
- min: 48 tokens
- mean: 110.67 tokens
- max: 196 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task165_mcscript_question_answering_commonsense
- Dataset: task165_mcscript_question_answering_commonsense
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 147 tokens
- mean: 198.24 tokens
- max: 256 tokens
- min: 145 tokens
- mean: 196.67 tokens
- max: 256 tokens
- min: 147 tokens
- mean: 198.41 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1533_daily_dialog_formal_classification
- Dataset: task1533_daily_dialog_formal_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 13 tokens
- mean: 129.55 tokens
- max: 256 tokens
- min: 15 tokens
- mean: 136.75 tokens
- max: 256 tokens
- min: 17 tokens
- mean: 137.33 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task002_quoref_answer_generation
- Dataset: task002_quoref_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 214 tokens
- mean: 255.54 tokens
- max: 256 tokens
- min: 214 tokens
- mean: 255.53 tokens
- max: 256 tokens
- min: 224 tokens
- mean: 255.61 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1297_qasc_question_answering
- Dataset: task1297_qasc_question_answering
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 61 tokens
- mean: 84.69 tokens
- max: 134 tokens
- min: 59 tokens
- mean: 85.39 tokens
- max: 130 tokens
- min: 58 tokens
- mean: 84.83 tokens
- max: 125 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task305_jeopardy_answer_generation_normal
- Dataset: task305_jeopardy_answer_generation_normal
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 9 tokens
- mean: 27.72 tokens
- max: 59 tokens
- min: 9 tokens
- mean: 27.43 tokens
- max: 45 tokens
- min: 11 tokens
- mean: 27.37 tokens
- max: 46 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task029_winogrande_full_object
- Dataset: task029_winogrande_full_object
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 7 tokens
- mean: 7.37 tokens
- max: 12 tokens
- min: 7 tokens
- mean: 7.32 tokens
- max: 11 tokens
- min: 7 tokens
- mean: 7.24 tokens
- max: 10 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1327_qa_zre_answer_generation_from_question
- Dataset: task1327_qa_zre_answer_generation_from_question
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 24 tokens
- mean: 55.0 tokens
- max: 256 tokens
- min: 23 tokens
- mean: 52.2 tokens
- max: 256 tokens
- min: 27 tokens
- mean: 55.59 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task326_jigsaw_classification_obscene
- Dataset: task326_jigsaw_classification_obscene
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 5 tokens
- mean: 65.45 tokens
- max: 256 tokens
- min: 5 tokens
- mean: 77.38 tokens
- max: 256 tokens
- min: 5 tokens
- mean: 74.07 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1542_every_ith_element_from_starting
- Dataset: task1542_every_ith_element_from_starting
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 13 tokens
- mean: 125.21 tokens
- max: 245 tokens
- min: 13 tokens
- mean: 123.54 tokens
- max: 244 tokens
- min: 13 tokens
- mean: 120.48 tokens
- max: 238 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task570_recipe_nlg_ner_generation
- Dataset: task570_recipe_nlg_ner_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 9 tokens
- mean: 74.07 tokens
- max: 250 tokens
- min: 5 tokens
- mean: 73.6 tokens
- max: 256 tokens
- min: 8 tokens
- mean: 76.08 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1409_dart_text_generation
- Dataset: task1409_dart_text_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 18 tokens
- mean: 67.5 tokens
- max: 174 tokens
- min: 18 tokens
- mean: 72.52 tokens
- max: 170 tokens
- min: 17 tokens
- mean: 67.55 tokens
- max: 164 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task401_numeric_fused_head_reference
- Dataset: task401_numeric_fused_head_reference
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 16 tokens
- mean: 109.08 tokens
- max: 256 tokens
- min: 16 tokens
- mean: 116.35 tokens
- max: 256 tokens
- min: 18 tokens
- mean: 119.65 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task846_pubmedqa_classification
- Dataset: task846_pubmedqa_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 32 tokens
- mean: 85.83 tokens
- max: 246 tokens
- min: 33 tokens
- mean: 85.03 tokens
- max: 225 tokens
- min: 28 tokens
- mean: 93.96 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1712_poki_classification
- Dataset: task1712_poki_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 6 tokens
- mean: 52.73 tokens
- max: 256 tokens
- min: 7 tokens
- mean: 55.65 tokens
- max: 256 tokens
- min: 7 tokens
- mean: 63.01 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task344_hybridqa_answer_generation
- Dataset: task344_hybridqa_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 9 tokens
- mean: 22.15 tokens
- max: 50 tokens
- min: 8 tokens
- mean: 22.07 tokens
- max: 58 tokens
- min: 7 tokens
- mean: 22.07 tokens
- max: 55 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task875_emotion_classification
- Dataset: task875_emotion_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 4 tokens
- mean: 23.03 tokens
- max: 75 tokens
- min: 4 tokens
- mean: 18.42 tokens
- max: 63 tokens
- min: 5 tokens
- mean: 20.36 tokens
- max: 68 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1214_atomic_classification_xwant
- Dataset: task1214_atomic_classification_xwant
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 14 tokens
- mean: 19.66 tokens
- max: 32 tokens
- min: 14 tokens
- mean: 19.39 tokens
- max: 29 tokens
- min: 14 tokens
- mean: 19.57 tokens
- max: 31 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task106_scruples_ethical_judgment
- Dataset: task106_scruples_ethical_judgment
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 12 tokens
- mean: 29.85 tokens
- max: 70 tokens
- min: 14 tokens
- mean: 28.96 tokens
- max: 86 tokens
- min: 14 tokens
- mean: 28.77 tokens
- max: 58 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task238_iirc_answer_from_passage_answer_generation
- Dataset: task238_iirc_answer_from_passage_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 138 tokens
- mean: 242.59 tokens
- max: 256 tokens
- min: 165 tokens
- mean: 242.86 tokens
- max: 256 tokens
- min: 173 tokens
- mean: 243.06 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1391_winogrande_easy_answer_generation
- Dataset: task1391_winogrande_easy_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 26 tokens
- mean: 31.69 tokens
- max: 54 tokens
- min: 26 tokens
- mean: 31.28 tokens
- max: 48 tokens
- min: 25 tokens
- mean: 31.16 tokens
- max: 49 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task195_sentiment140_classification
- Dataset: task195_sentiment140_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 4 tokens
- mean: 22.62 tokens
- max: 118 tokens
- min: 4 tokens
- mean: 18.82 tokens
- max: 79 tokens
- min: 5 tokens
- mean: 21.32 tokens
- max: 51 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task163_count_words_ending_with_letter
- Dataset: task163_count_words_ending_with_letter
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 28 tokens
- mean: 32.06 tokens
- max: 54 tokens
- min: 28 tokens
- mean: 31.69 tokens
- max: 57 tokens
- min: 28 tokens
- mean: 31.58 tokens
- max: 43 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task579_socialiqa_classification
- Dataset: task579_socialiqa_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 39 tokens
- mean: 54.2 tokens
- max: 132 tokens
- min: 36 tokens
- mean: 53.61 tokens
- max: 103 tokens
- min: 40 tokens
- mean: 54.16 tokens
- max: 84 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task569_recipe_nlg_text_generation
- Dataset: task569_recipe_nlg_text_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 25 tokens
- mean: 193.73 tokens
- max: 256 tokens
- min: 55 tokens
- mean: 193.64 tokens
- max: 256 tokens
- min: 37 tokens
- mean: 198.12 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1602_webquestion_question_genreation
- Dataset: task1602_webquestion_question_genreation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 12 tokens
- mean: 23.64 tokens
- max: 112 tokens
- min: 12 tokens
- mean: 24.12 tokens
- max: 112 tokens
- min: 12 tokens
- mean: 22.49 tokens
- max: 120 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task747_glucose_cause_emotion_detection
- Dataset: task747_glucose_cause_emotion_detection
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 35 tokens
- mean: 68.15 tokens
- max: 112 tokens
- min: 36 tokens
- mean: 68.3 tokens
- max: 108 tokens
- min: 36 tokens
- mean: 68.79 tokens
- max: 99 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task219_rocstories_title_answer_generation
- Dataset: task219_rocstories_title_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 42 tokens
- mean: 67.71 tokens
- max: 97 tokens
- min: 45 tokens
- mean: 66.7 tokens
- max: 97 tokens
- min: 41 tokens
- mean: 66.92 tokens
- max: 96 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task178_quartz_question_answering
- Dataset: task178_quartz_question_answering
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 28 tokens
- mean: 57.78 tokens
- max: 110 tokens
- min: 28 tokens
- mean: 57.44 tokens
- max: 111 tokens
- min: 28 tokens
- mean: 56.86 tokens
- max: 102 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task103_facts2story_long_text_generation
- Dataset: task103_facts2story_long_text_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 52 tokens
- mean: 80.49 tokens
- max: 143 tokens
- min: 51 tokens
- mean: 82.22 tokens
- max: 157 tokens
- min: 49 tokens
- mean: 78.96 tokens
- max: 145 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task301_record_question_generation
- Dataset: task301_record_question_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 140 tokens
- mean: 210.71 tokens
- max: 256 tokens
- min: 139 tokens
- mean: 209.62 tokens
- max: 256 tokens
- min: 143 tokens
- mean: 208.74 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1369_healthfact_sentence_generation
- Dataset: task1369_healthfact_sentence_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 110 tokens
- mean: 243.25 tokens
- max: 256 tokens
- min: 101 tokens
- mean: 243.17 tokens
- max: 256 tokens
- min: 113 tokens
- mean: 251.67 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task515_senteval_odd_word_out
- Dataset: task515_senteval_odd_word_out
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 7 tokens
- mean: 19.72 tokens
- max: 36 tokens
- min: 7 tokens
- mean: 19.13 tokens
- max: 38 tokens
- min: 7 tokens
- mean: 19.0 tokens
- max: 35 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task496_semeval_answer_generation
- Dataset: task496_semeval_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 4 tokens
- mean: 28.11 tokens
- max: 46 tokens
- min: 18 tokens
- mean: 27.8 tokens
- max: 45 tokens
- min: 19 tokens
- mean: 27.68 tokens
- max: 45 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1658_billsum_summarization
- Dataset: task1658_billsum_summarization
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 256 tokens
- mean: 256.0 tokens
- max: 256 tokens
- min: 256 tokens
- mean: 256.0 tokens
- max: 256 tokens
- min: 256 tokens
- mean: 256.0 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1204_atomic_classification_hinderedby
- Dataset: task1204_atomic_classification_hinderedby
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 14 tokens
- mean: 22.1 tokens
- max: 35 tokens
- min: 14 tokens
- mean: 22.07 tokens
- max: 34 tokens
- min: 14 tokens
- mean: 21.5 tokens
- max: 38 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1392_superglue_multirc_answer_verification
- Dataset: task1392_superglue_multirc_answer_verification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 128 tokens
- mean: 241.77 tokens
- max: 256 tokens
- min: 127 tokens
- mean: 241.97 tokens
- max: 256 tokens
- min: 136 tokens
- mean: 242.04 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task306_jeopardy_answer_generation_double
- Dataset: task306_jeopardy_answer_generation_double
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 10 tokens
- mean: 27.79 tokens
- max: 47 tokens
- min: 10 tokens
- mean: 27.16 tokens
- max: 46 tokens
- min: 11 tokens
- mean: 27.61 tokens
- max: 47 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1286_openbookqa_question_answering
- Dataset: task1286_openbookqa_question_answering
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 22 tokens
- mean: 39.54 tokens
- max: 85 tokens
- min: 23 tokens
- mean: 38.94 tokens
- max: 96 tokens
- min: 22 tokens
- mean: 38.26 tokens
- max: 89 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task159_check_frequency_of_words_in_sentence_pair
- Dataset: task159_check_frequency_of_words_in_sentence_pair
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 44 tokens
- mean: 50.37 tokens
- max: 67 tokens
- min: 44 tokens
- mean: 50.35 tokens
- max: 67 tokens
- min: 44 tokens
- mean: 50.61 tokens
- max: 66 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task151_tomqa_find_location_easy_clean
- Dataset: task151_tomqa_find_location_easy_clean
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 37 tokens
- mean: 50.73 tokens
- max: 79 tokens
- min: 37 tokens
- mean: 50.28 tokens
- max: 74 tokens
- min: 37 tokens
- mean: 50.52 tokens
- max: 74 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task323_jigsaw_classification_sexually_explicit
- Dataset: task323_jigsaw_classification_sexually_explicit
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 6 tokens
- mean: 66.26 tokens
- max: 248 tokens
- min: 5 tokens
- mean: 76.73 tokens
- max: 248 tokens
- min: 6 tokens
- mean: 75.5 tokens
- max: 251 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task037_qasc_generate_related_fact
- Dataset: task037_qasc_generate_related_fact
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 13 tokens
- mean: 22.04 tokens
- max: 50 tokens
- min: 13 tokens
- mean: 22.03 tokens
- max: 42 tokens
- min: 13 tokens
- mean: 21.9 tokens
- max: 40 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task027_drop_answer_type_generation
- Dataset: task027_drop_answer_type_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 87 tokens
- mean: 229.02 tokens
- max: 256 tokens
- min: 74 tokens
- mean: 230.67 tokens
- max: 256 tokens
- min: 71 tokens
- mean: 232.43 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1596_event2mind_text_generation_2
- Dataset: task1596_event2mind_text_generation_2
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 6 tokens
- mean: 9.97 tokens
- max: 18 tokens
- min: 6 tokens
- mean: 10.03 tokens
- max: 19 tokens
- min: 6 tokens
- mean: 10.06 tokens
- max: 18 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task141_odd-man-out_classification_category
- Dataset: task141_odd-man-out_classification_category
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 16 tokens
- mean: 18.45 tokens
- max: 28 tokens
- min: 16 tokens
- mean: 18.38 tokens
- max: 26 tokens
- min: 16 tokens
- mean: 18.46 tokens
- max: 25 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task194_duorc_answer_generation
- Dataset: task194_duorc_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 149 tokens
- mean: 251.76 tokens
- max: 256 tokens
- min: 147 tokens
- mean: 252.05 tokens
- max: 256 tokens
- min: 148 tokens
- mean: 251.76 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task679_hope_edi_english_text_classification
- Dataset: task679_hope_edi_english_text_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 5 tokens
- mean: 27.77 tokens
- max: 199 tokens
- min: 4 tokens
- mean: 27.23 tokens
- max: 205 tokens
- min: 5 tokens
- mean: 29.87 tokens
- max: 194 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task246_dream_question_generation
- Dataset: task246_dream_question_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 17 tokens
- mean: 80.33 tokens
- max: 256 tokens
- min: 14 tokens
- mean: 80.74 tokens
- max: 256 tokens
- min: 15 tokens
- mean: 87.22 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1195_disflqa_disfluent_to_fluent_conversion
- Dataset: task1195_disflqa_disfluent_to_fluent_conversion
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 9 tokens
- mean: 19.76 tokens
- max: 41 tokens
- min: 9 tokens
- mean: 19.88 tokens
- max: 40 tokens
- min: 2 tokens
- mean: 20.2 tokens
- max: 44 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task065_timetravel_consistent_sentence_classification
- Dataset: task065_timetravel_consistent_sentence_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 55 tokens
- mean: 79.4 tokens
- max: 117 tokens
- min: 51 tokens
- mean: 79.17 tokens
- max: 110 tokens
- min: 53 tokens
- mean: 80.1 tokens
- max: 110 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task351_winomt_classification_gender_identifiability_anti
- Dataset: task351_winomt_classification_gender_identifiability_anti
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 16 tokens
- mean: 21.76 tokens
- max: 30 tokens
- min: 16 tokens
- mean: 21.66 tokens
- max: 31 tokens
- min: 16 tokens
- mean: 21.78 tokens
- max: 30 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task580_socialiqa_answer_generation
- Dataset: task580_socialiqa_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 35 tokens
- mean: 52.41 tokens
- max: 107 tokens
- min: 35 tokens
- mean: 51.02 tokens
- max: 86 tokens
- min: 35 tokens
- mean: 50.98 tokens
- max: 87 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task583_udeps_eng_coarse_pos_tagging
- Dataset: task583_udeps_eng_coarse_pos_tagging
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 12 tokens
- mean: 41.24 tokens
- max: 185 tokens
- min: 12 tokens
- mean: 40.21 tokens
- max: 185 tokens
- min: 12 tokens
- mean: 40.93 tokens
- max: 185 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task202_mnli_contradiction_classification
- Dataset: task202_mnli_contradiction_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 24 tokens
- mean: 73.7 tokens
- max: 190 tokens
- min: 28 tokens
- mean: 76.06 tokens
- max: 256 tokens
- min: 23 tokens
- mean: 74.56 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task222_rocstories_two_chioce_slotting_classification
- Dataset: task222_rocstories_two_chioce_slotting_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 48 tokens
- mean: 73.06 tokens
- max: 105 tokens
- min: 48 tokens
- mean: 73.24 tokens
- max: 100 tokens
- min: 49 tokens
- mean: 71.71 tokens
- max: 102 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task498_scruples_anecdotes_whoiswrong_classification
- Dataset: task498_scruples_anecdotes_whoiswrong_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 24 tokens
- mean: 225.8 tokens
- max: 256 tokens
- min: 47 tokens
- mean: 232.86 tokens
- max: 256 tokens
- min: 47 tokens
- mean: 231.22 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task067_abductivenli_answer_generation
- Dataset: task067_abductivenli_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 14 tokens
- mean: 26.75 tokens
- max: 40 tokens
- min: 14 tokens
- mean: 26.13 tokens
- max: 42 tokens
- min: 15 tokens
- mean: 26.34 tokens
- max: 38 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task616_cola_classification
- Dataset: task616_cola_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 5 tokens
- mean: 12.16 tokens
- max: 33 tokens
- min: 5 tokens
- mean: 12.05 tokens
- max: 33 tokens
- min: 6 tokens
- mean: 11.96 tokens
- max: 29 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task286_olid_offense_judgment
- Dataset: task286_olid_offense_judgment
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 5 tokens
- mean: 32.85 tokens
- max: 145 tokens
- min: 5 tokens
- mean: 30.81 tokens
- max: 171 tokens
- min: 5 tokens
- mean: 30.26 tokens
- max: 169 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task188_snli_neutral_to_entailment_text_modification
- Dataset: task188_snli_neutral_to_entailment_text_modification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 18 tokens
- mean: 31.55 tokens
- max: 79 tokens
- min: 18 tokens
- mean: 31.31 tokens
- max: 84 tokens
- min: 18 tokens
- mean: 32.91 tokens
- max: 84 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task223_quartz_explanation_generation
- Dataset: task223_quartz_explanation_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 12 tokens
- mean: 31.46 tokens
- max: 68 tokens
- min: 13 tokens
- mean: 31.8 tokens
- max: 68 tokens
- min: 13 tokens
- mean: 28.95 tokens
- max: 96 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task820_protoqa_answer_generation
- Dataset: task820_protoqa_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 6 tokens
- mean: 14.87 tokens
- max: 29 tokens
- min: 7 tokens
- mean: 14.54 tokens
- max: 27 tokens
- min: 6 tokens
- mean: 14.22 tokens
- max: 29 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task196_sentiment140_answer_generation
- Dataset: task196_sentiment140_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 17 tokens
- mean: 36.26 tokens
- max: 72 tokens
- min: 17 tokens
- mean: 32.85 tokens
- max: 61 tokens
- min: 17 tokens
- mean: 36.27 tokens
- max: 72 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1678_mathqa_answer_selection
- Dataset: task1678_mathqa_answer_selection
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 33 tokens
- mean: 70.42 tokens
- max: 177 tokens
- min: 30 tokens
- mean: 68.99 tokens
- max: 146 tokens
- min: 33 tokens
- mean: 69.69 tokens
- max: 160 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task349_squad2.0_answerable_unanswerable_question_classification
- Dataset: task349_squad2.0_answerable_unanswerable_question_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 53 tokens
- mean: 176.83 tokens
- max: 256 tokens
- min: 57 tokens
- mean: 177.07 tokens
- max: 256 tokens
- min: 53 tokens
- mean: 176.78 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task154_tomqa_find_location_hard_noise
- Dataset: task154_tomqa_find_location_hard_noise
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 129 tokens
- mean: 176.29 tokens
- max: 253 tokens
- min: 126 tokens
- mean: 176.3 tokens
- max: 249 tokens
- min: 128 tokens
- mean: 178.24 tokens
- max: 254 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task333_hateeval_classification_hate_en
- Dataset: task333_hateeval_classification_hate_en
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 8 tokens
- mean: 38.33 tokens
- max: 117 tokens
- min: 7 tokens
- mean: 36.79 tokens
- max: 109 tokens
- min: 7 tokens
- mean: 36.61 tokens
- max: 113 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task235_iirc_question_from_subtext_answer_generation
- Dataset: task235_iirc_question_from_subtext_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 14 tokens
- mean: 52.9 tokens
- max: 256 tokens
- min: 12 tokens
- mean: 50.44 tokens
- max: 256 tokens
- min: 12 tokens
- mean: 55.89 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1554_scitail_classification
- Dataset: task1554_scitail_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 7 tokens
- mean: 16.8 tokens
- max: 38 tokens
- min: 7 tokens
- mean: 25.75 tokens
- max: 68 tokens
- min: 7 tokens
- mean: 24.34 tokens
- max: 59 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task210_logic2text_structured_text_generation
- Dataset: task210_logic2text_structured_text_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 13 tokens
- mean: 31.88 tokens
- max: 101 tokens
- min: 13 tokens
- mean: 30.88 tokens
- max: 94 tokens
- min: 12 tokens
- mean: 32.75 tokens
- max: 89 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task035_winogrande_question_modification_person
- Dataset: task035_winogrande_question_modification_person
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 31 tokens
- mean: 36.16 tokens
- max: 50 tokens
- min: 31 tokens
- mean: 35.75 tokens
- max: 55 tokens
- min: 31 tokens
- mean: 35.41 tokens
- max: 48 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task230_iirc_passage_classification
- Dataset: task230_iirc_passage_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 256 tokens
- mean: 256.0 tokens
- max: 256 tokens
- min: 256 tokens
- mean: 256.0 tokens
- max: 256 tokens
- min: 256 tokens
- mean: 256.0 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1356_xlsum_title_generation
- Dataset: task1356_xlsum_title_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 59 tokens
- mean: 239.92 tokens
- max: 256 tokens
- min: 58 tokens
- mean: 240.94 tokens
- max: 256 tokens
- min: 64 tokens
- mean: 248.75 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1726_mathqa_correct_answer_generation
- Dataset: task1726_mathqa_correct_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 10 tokens
- mean: 43.81 tokens
- max: 156 tokens
- min: 12 tokens
- mean: 42.63 tokens
- max: 129 tokens
- min: 11 tokens
- mean: 42.82 tokens
- max: 133 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task302_record_classification
- Dataset: task302_record_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 194 tokens
- mean: 253.35 tokens
- max: 256 tokens
- min: 198 tokens
- mean: 252.85 tokens
- max: 256 tokens
- min: 195 tokens
- mean: 252.78 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task380_boolq_yes_no_question
- Dataset: task380_boolq_yes_no_question
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 26 tokens
- mean: 134.17 tokens
- max: 256 tokens
- min: 26 tokens
- mean: 138.56 tokens
- max: 256 tokens
- min: 27 tokens
- mean: 138.25 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task212_logic2text_classification
- Dataset: task212_logic2text_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 14 tokens
- mean: 33.28 tokens
- max: 146 tokens
- min: 14 tokens
- mean: 32.14 tokens
- max: 146 tokens
- min: 14 tokens
- mean: 32.96 tokens
- max: 127 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task748_glucose_reverse_cause_event_detection
- Dataset: task748_glucose_reverse_cause_event_detection
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 35 tokens
- mean: 67.63 tokens
- max: 105 tokens
- min: 38 tokens
- mean: 66.95 tokens
- max: 106 tokens
- min: 39 tokens
- mean: 68.94 tokens
- max: 105 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task834_mathdataset_classification
- Dataset: task834_mathdataset_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 6 tokens
- mean: 27.7 tokens
- max: 83 tokens
- min: 6 tokens
- mean: 27.88 tokens
- max: 83 tokens
- min: 5 tokens
- mean: 26.97 tokens
- max: 93 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task350_winomt_classification_gender_identifiability_pro
- Dataset: task350_winomt_classification_gender_identifiability_pro
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 16 tokens
- mean: 21.79 tokens
- max: 30 tokens
- min: 16 tokens
- mean: 21.63 tokens
- max: 30 tokens
- min: 16 tokens
- mean: 21.79 tokens
- max: 30 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task191_hotpotqa_question_generation
- Dataset: task191_hotpotqa_question_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 198 tokens
- mean: 255.88 tokens
- max: 256 tokens
- min: 238 tokens
- mean: 255.93 tokens
- max: 256 tokens
- min: 255 tokens
- mean: 256.0 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task236_iirc_question_from_passage_answer_generation
- Dataset: task236_iirc_question_from_passage_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 135 tokens
- mean: 238.3 tokens
- max: 256 tokens
- min: 155 tokens
- mean: 237.61 tokens
- max: 256 tokens
- min: 154 tokens
- mean: 239.64 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task217_rocstories_ordering_answer_generation
- Dataset: task217_rocstories_ordering_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 45 tokens
- mean: 72.32 tokens
- max: 107 tokens
- min: 48 tokens
- mean: 72.29 tokens
- max: 107 tokens
- min: 48 tokens
- mean: 70.87 tokens
- max: 105 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task568_circa_question_generation
- Dataset: task568_circa_question_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 4 tokens
- mean: 9.6 tokens
- max: 25 tokens
- min: 4 tokens
- mean: 9.46 tokens
- max: 20 tokens
- min: 4 tokens
- mean: 8.93 tokens
- max: 20 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task614_glucose_cause_event_detection
- Dataset: task614_glucose_cause_event_detection
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 39 tokens
- mean: 67.66 tokens
- max: 102 tokens
- min: 39 tokens
- mean: 67.16 tokens
- max: 106 tokens
- min: 38 tokens
- mean: 68.48 tokens
- max: 103 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task361_spolin_yesand_prompt_response_classification
- Dataset: task361_spolin_yesand_prompt_response_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 18 tokens
- mean: 47.01 tokens
- max: 137 tokens
- min: 17 tokens
- mean: 46.18 tokens
- max: 119 tokens
- min: 17 tokens
- mean: 47.2 tokens
- max: 128 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task421_persent_sentence_sentiment_classification
- Dataset: task421_persent_sentence_sentiment_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 22 tokens
- mean: 67.77 tokens
- max: 256 tokens
- min: 22 tokens
- mean: 71.21 tokens
- max: 256 tokens
- min: 19 tokens
- mean: 72.24 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task203_mnli_sentence_generation
- Dataset: task203_mnli_sentence_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 14 tokens
- mean: 38.73 tokens
- max: 175 tokens
- min: 14 tokens
- mean: 35.74 tokens
- max: 175 tokens
- min: 13 tokens
- mean: 34.18 tokens
- max: 170 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task420_persent_document_sentiment_classification
- Dataset: task420_persent_document_sentiment_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 22 tokens
- mean: 224.14 tokens
- max: 256 tokens
- min: 22 tokens
- mean: 233.63 tokens
- max: 256 tokens
- min: 22 tokens
- mean: 227.59 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task153_tomqa_find_location_hard_clean
- Dataset: task153_tomqa_find_location_hard_clean
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 39 tokens
- mean: 160.13 tokens
- max: 256 tokens
- min: 39 tokens
- mean: 159.86 tokens
- max: 256 tokens
- min: 39 tokens
- mean: 162.75 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task346_hybridqa_classification
- Dataset: task346_hybridqa_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 18 tokens
- mean: 32.87 tokens
- max: 68 tokens
- min: 18 tokens
- mean: 31.92 tokens
- max: 63 tokens
- min: 19 tokens
- mean: 31.83 tokens
- max: 75 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1211_atomic_classification_hassubevent
- Dataset: task1211_atomic_classification_hassubevent
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 11 tokens
- mean: 16.25 tokens
- max: 31 tokens
- min: 11 tokens
- mean: 16.02 tokens
- max: 29 tokens
- min: 11 tokens
- mean: 16.89 tokens
- max: 29 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task360_spolin_yesand_response_generation
- Dataset: task360_spolin_yesand_response_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 7 tokens
- mean: 22.54 tokens
- max: 89 tokens
- min: 6 tokens
- mean: 21.16 tokens
- max: 92 tokens
- min: 7 tokens
- mean: 20.91 tokens
- max: 67 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task510_reddit_tifu_title_summarization
- Dataset: task510_reddit_tifu_title_summarization
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 9 tokens
- mean: 217.53 tokens
- max: 256 tokens
- min: 20 tokens
- mean: 218.59 tokens
- max: 256 tokens
- min: 10 tokens
- mean: 221.41 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task511_reddit_tifu_long_text_summarization
- Dataset: task511_reddit_tifu_long_text_summarization
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 29 tokens
- mean: 239.72 tokens
- max: 256 tokens
- min: 76 tokens
- mean: 238.38 tokens
- max: 256 tokens
- min: 43 tokens
- mean: 245.03 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task345_hybridqa_answer_generation
- Dataset: task345_hybridqa_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 9 tokens
- mean: 22.14 tokens
- max: 50 tokens
- min: 10 tokens
- mean: 21.6 tokens
- max: 70 tokens
- min: 8 tokens
- mean: 20.96 tokens
- max: 47 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task270_csrg_counterfactual_context_generation
- Dataset: task270_csrg_counterfactual_context_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 63 tokens
- mean: 100.05 tokens
- max: 158 tokens
- min: 63 tokens
- mean: 98.61 tokens
- max: 142 tokens
- min: 62 tokens
- mean: 100.35 tokens
- max: 141 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task307_jeopardy_answer_generation_final
- Dataset: task307_jeopardy_answer_generation_final
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 15 tokens
- mean: 29.61 tokens
- max: 46 tokens
- min: 15 tokens
- mean: 29.31 tokens
- max: 53 tokens
- min: 15 tokens
- mean: 29.28 tokens
- max: 43 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task001_quoref_question_generation
- Dataset: task001_quoref_question_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 201 tokens
- mean: 254.96 tokens
- max: 256 tokens
- min: 99 tokens
- mean: 254.28 tokens
- max: 256 tokens
- min: 173 tokens
- mean: 255.13 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task089_swap_words_verification
- Dataset: task089_swap_words_verification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 9 tokens
- mean: 12.86 tokens
- max: 28 tokens
- min: 9 tokens
- mean: 12.64 tokens
- max: 24 tokens
- min: 9 tokens
- mean: 12.26 tokens
- max: 22 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1196_atomic_classification_oeffect
- Dataset: task1196_atomic_classification_oeffect
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 14 tokens
- mean: 18.79 tokens
- max: 41 tokens
- min: 14 tokens
- mean: 18.57 tokens
- max: 30 tokens
- min: 14 tokens
- mean: 18.51 tokens
- max: 29 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task080_piqa_answer_generation
- Dataset: task080_piqa_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 3 tokens
- mean: 10.82 tokens
- max: 33 tokens
- min: 3 tokens
- mean: 10.77 tokens
- max: 24 tokens
- min: 3 tokens
- mean: 10.03 tokens
- max: 26 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1598_nyc_long_text_generation
- Dataset: task1598_nyc_long_text_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 17 tokens
- mean: 35.5 tokens
- max: 56 tokens
- min: 17 tokens
- mean: 35.66 tokens
- max: 56 tokens
- min: 20 tokens
- mean: 36.66 tokens
- max: 55 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task240_tweetqa_question_generation
- Dataset: task240_tweetqa_question_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 27 tokens
- mean: 51.18 tokens
- max: 94 tokens
- min: 25 tokens
- mean: 50.72 tokens
- max: 92 tokens
- min: 20 tokens
- mean: 51.63 tokens
- max: 95 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task615_moviesqa_answer_generation
- Dataset: task615_moviesqa_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 6 tokens
- mean: 11.46 tokens
- max: 23 tokens
- min: 7 tokens
- mean: 11.44 tokens
- max: 19 tokens
- min: 5 tokens
- mean: 11.4 tokens
- max: 22 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1347_glue_sts-b_similarity_classification
- Dataset: task1347_glue_sts-b_similarity_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 17 tokens
- mean: 31.13 tokens
- max: 88 tokens
- min: 16 tokens
- mean: 31.12 tokens
- max: 92 tokens
- min: 16 tokens
- mean: 30.85 tokens
- max: 92 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task114_is_the_given_word_longest
- Dataset: task114_is_the_given_word_longest
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 25 tokens
- mean: 28.87 tokens
- max: 68 tokens
- min: 25 tokens
- mean: 28.46 tokens
- max: 48 tokens
- min: 25 tokens
- mean: 28.7 tokens
- max: 47 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task292_storycommonsense_character_text_generation
- Dataset: task292_storycommonsense_character_text_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 43 tokens
- mean: 67.87 tokens
- max: 98 tokens
- min: 46 tokens
- mean: 67.11 tokens
- max: 104 tokens
- min: 43 tokens
- mean: 69.05 tokens
- max: 96 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task115_help_advice_classification
- Dataset: task115_help_advice_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 2 tokens
- mean: 19.89 tokens
- max: 91 tokens
- min: 3 tokens
- mean: 18.13 tokens
- max: 92 tokens
- min: 4 tokens
- mean: 19.22 tokens
- max: 137 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task431_senteval_object_count
- Dataset: task431_senteval_object_count
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 7 tokens
- mean: 16.78 tokens
- max: 37 tokens
- min: 7 tokens
- mean: 15.12 tokens
- max: 36 tokens
- min: 7 tokens
- mean: 15.72 tokens
- max: 35 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1360_numer_sense_multiple_choice_qa_generation
- Dataset: task1360_numer_sense_multiple_choice_qa_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 32 tokens
- mean: 40.62 tokens
- max: 54 tokens
- min: 32 tokens
- mean: 40.3 tokens
- max: 53 tokens
- min: 32 tokens
- mean: 40.28 tokens
- max: 60 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task177_para-nmt_paraphrasing
- Dataset: task177_para-nmt_paraphrasing
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 8 tokens
- mean: 19.86 tokens
- max: 82 tokens
- min: 9 tokens
- mean: 18.91 tokens
- max: 58 tokens
- min: 9 tokens
- mean: 18.22 tokens
- max: 36 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task132_dais_text_modification
- Dataset: task132_dais_text_modification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 6 tokens
- mean: 9.3 tokens
- max: 15 tokens
- min: 6 tokens
- mean: 9.08 tokens
- max: 15 tokens
- min: 6 tokens
- mean: 10.11 tokens
- max: 15 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task269_csrg_counterfactual_story_generation
- Dataset: task269_csrg_counterfactual_story_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 49 tokens
- mean: 79.95 tokens
- max: 111 tokens
- min: 53 tokens
- mean: 79.51 tokens
- max: 116 tokens
- min: 48 tokens
- mean: 79.5 tokens
- max: 114 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task233_iirc_link_exists_classification
- Dataset: task233_iirc_link_exists_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 145 tokens
- mean: 235.67 tokens
- max: 256 tokens
- min: 142 tokens
- mean: 233.59 tokens
- max: 256 tokens
- min: 151 tokens
- mean: 235.1 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task161_count_words_containing_letter
- Dataset: task161_count_words_containing_letter
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 27 tokens
- mean: 30.99 tokens
- max: 53 tokens
- min: 27 tokens
- mean: 30.8 tokens
- max: 61 tokens
- min: 27 tokens
- mean: 30.5 tokens
- max: 42 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1205_atomic_classification_isafter
- Dataset: task1205_atomic_classification_isafter
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 14 tokens
- mean: 20.91 tokens
- max: 37 tokens
- min: 14 tokens
- mean: 20.65 tokens
- max: 35 tokens
- min: 14 tokens
- mean: 21.51 tokens
- max: 37 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task571_recipe_nlg_ner_generation
- Dataset: task571_recipe_nlg_ner_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 5 tokens
- mean: 118.38 tokens
- max: 256 tokens
- min: 7 tokens
- mean: 118.92 tokens
- max: 256 tokens
- min: 6 tokens
- mean: 111.39 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1292_yelp_review_full_text_categorization
- Dataset: task1292_yelp_review_full_text_categorization
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 4 tokens
- mean: 136.66 tokens
- max: 256 tokens
- min: 7 tokens
- mean: 146.65 tokens
- max: 256 tokens
- min: 3 tokens
- mean: 146.05 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task428_senteval_inversion
- Dataset: task428_senteval_inversion
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 7 tokens
- mean: 16.69 tokens
- max: 32 tokens
- min: 7 tokens
- mean: 14.58 tokens
- max: 31 tokens
- min: 7 tokens
- mean: 15.26 tokens
- max: 34 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task311_race_question_generation
- Dataset: task311_race_question_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 115 tokens
- mean: 254.87 tokens
- max: 256 tokens
- min: 137 tokens
- mean: 254.4 tokens
- max: 256 tokens
- min: 171 tokens
- mean: 255.44 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task429_senteval_tense
- Dataset: task429_senteval_tense
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 7 tokens
- mean: 15.84 tokens
- max: 37 tokens
- min: 6 tokens
- mean: 13.96 tokens
- max: 33 tokens
- min: 7 tokens
- mean: 15.25 tokens
- max: 36 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task403_creak_commonsense_inference
- Dataset: task403_creak_commonsense_inference
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 13 tokens
- mean: 30.24 tokens
- max: 104 tokens
- min: 13 tokens
- mean: 29.39 tokens
- max: 108 tokens
- min: 13 tokens
- mean: 29.32 tokens
- max: 122 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task929_products_reviews_classification
- Dataset: task929_products_reviews_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 5 tokens
- mean: 69.68 tokens
- max: 126 tokens
- min: 6 tokens
- mean: 70.66 tokens
- max: 123 tokens
- min: 6 tokens
- mean: 70.61 tokens
- max: 123 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task582_naturalquestion_answer_generation
- Dataset: task582_naturalquestion_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 10 tokens
- mean: 11.71 tokens
- max: 25 tokens
- min: 10 tokens
- mean: 11.65 tokens
- max: 24 tokens
- min: 10 tokens
- mean: 11.73 tokens
- max: 25 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task237_iirc_answer_from_subtext_answer_generation
- Dataset: task237_iirc_answer_from_subtext_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 22 tokens
- mean: 66.3 tokens
- max: 256 tokens
- min: 25 tokens
- mean: 64.61 tokens
- max: 256 tokens
- min: 23 tokens
- mean: 61.49 tokens
- max: 161 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task050_multirc_answerability
- Dataset: task050_multirc_answerability
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 15 tokens
- mean: 32.3 tokens
- max: 112 tokens
- min: 14 tokens
- mean: 31.56 tokens
- max: 93 tokens
- min: 15 tokens
- mean: 32.13 tokens
- max: 159 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task184_break_generate_question
- Dataset: task184_break_generate_question
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 13 tokens
- mean: 39.73 tokens
- max: 147 tokens
- min: 13 tokens
- mean: 38.83 tokens
- max: 149 tokens
- min: 13 tokens
- mean: 39.61 tokens
- max: 148 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task669_ambigqa_answer_generation
- Dataset: task669_ambigqa_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 10 tokens
- mean: 12.94 tokens
- max: 23 tokens
- min: 10 tokens
- mean: 12.88 tokens
- max: 27 tokens
- min: 11 tokens
- mean: 12.76 tokens
- max: 22 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task169_strategyqa_sentence_generation
- Dataset: task169_strategyqa_sentence_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 19 tokens
- mean: 35.21 tokens
- max: 65 tokens
- min: 22 tokens
- mean: 34.25 tokens
- max: 60 tokens
- min: 19 tokens
- mean: 33.3 tokens
- max: 65 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task500_scruples_anecdotes_title_generation
- Dataset: task500_scruples_anecdotes_title_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 14 tokens
- mean: 225.76 tokens
- max: 256 tokens
- min: 31 tokens
- mean: 233.16 tokens
- max: 256 tokens
- min: 27 tokens
- mean: 235.28 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task241_tweetqa_classification
- Dataset: task241_tweetqa_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 31 tokens
- mean: 61.75 tokens
- max: 92 tokens
- min: 36 tokens
- mean: 62.23 tokens
- max: 106 tokens
- min: 31 tokens
- mean: 61.7 tokens
- max: 92 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1345_glue_qqp_question_paraprashing
- Dataset: task1345_glue_qqp_question_paraprashing
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 6 tokens
- mean: 16.86 tokens
- max: 60 tokens
- min: 6 tokens
- mean: 15.83 tokens
- max: 69 tokens
- min: 6 tokens
- mean: 16.62 tokens
- max: 51 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task218_rocstories_swap_order_answer_generation
- Dataset: task218_rocstories_swap_order_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 48 tokens
- mean: 72.41 tokens
- max: 118 tokens
- min: 48 tokens
- mean: 72.48 tokens
- max: 102 tokens
- min: 47 tokens
- mean: 72.1 tokens
- max: 106 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task613_politifact_text_generation
- Dataset: task613_politifact_text_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 4 tokens
- mean: 24.87 tokens
- max: 75 tokens
- min: 7 tokens
- mean: 23.39 tokens
- max: 56 tokens
- min: 5 tokens
- mean: 23.07 tokens
- max: 61 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1167_penn_treebank_coarse_pos_tagging
- Dataset: task1167_penn_treebank_coarse_pos_tagging
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 16 tokens
- mean: 53.65 tokens
- max: 200 tokens
- min: 16 tokens
- mean: 53.64 tokens
- max: 220 tokens
- min: 16 tokens
- mean: 54.8 tokens
- max: 202 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1422_mathqa_physics
- Dataset: task1422_mathqa_physics
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 34 tokens
- mean: 72.71 tokens
- max: 164 tokens
- min: 38 tokens
- mean: 71.93 tokens
- max: 157 tokens
- min: 39 tokens
- mean: 72.67 tokens
- max: 155 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task247_dream_answer_generation
- Dataset: task247_dream_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 38 tokens
- mean: 160.28 tokens
- max: 256 tokens
- min: 39 tokens
- mean: 159.0 tokens
- max: 256 tokens
- min: 41 tokens
- mean: 167.8 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task199_mnli_classification
- Dataset: task199_mnli_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 13 tokens
- mean: 43.07 tokens
- max: 127 tokens
- min: 11 tokens
- mean: 44.72 tokens
- max: 149 tokens
- min: 11 tokens
- mean: 43.81 tokens
- max: 113 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task164_mcscript_question_answering_text
- Dataset: task164_mcscript_question_answering_text
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 150 tokens
- mean: 200.63 tokens
- max: 256 tokens
- min: 150 tokens
- mean: 200.9 tokens
- max: 256 tokens
- min: 142 tokens
- mean: 200.85 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1541_agnews_classification
- Dataset: task1541_agnews_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 21 tokens
- mean: 53.59 tokens
- max: 256 tokens
- min: 18 tokens
- mean: 53.09 tokens
- max: 256 tokens
- min: 18 tokens
- mean: 53.95 tokens
- max: 161 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task516_senteval_conjoints_inversion
- Dataset: task516_senteval_conjoints_inversion
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 8 tokens
- mean: 20.33 tokens
- max: 34 tokens
- min: 8 tokens
- mean: 19.01 tokens
- max: 34 tokens
- min: 8 tokens
- mean: 18.96 tokens
- max: 34 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task294_storycommonsense_motiv_text_generation
- Dataset: task294_storycommonsense_motiv_text_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 14 tokens
- mean: 40.09 tokens
- max: 86 tokens
- min: 14 tokens
- mean: 40.77 tokens
- max: 86 tokens
- min: 14 tokens
- mean: 39.86 tokens
- max: 86 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task501_scruples_anecdotes_post_type_verification
- Dataset: task501_scruples_anecdotes_post_type_verification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 18 tokens
- mean: 231.55 tokens
- max: 256 tokens
- min: 12 tokens
- mean: 235.21 tokens
- max: 256 tokens
- min: 18 tokens
- mean: 234.47 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task213_rocstories_correct_ending_classification
- Dataset: task213_rocstories_correct_ending_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 62 tokens
- mean: 86.17 tokens
- max: 125 tokens
- min: 60 tokens
- mean: 85.49 tokens
- max: 131 tokens
- min: 59 tokens
- mean: 86.18 tokens
- max: 131 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task821_protoqa_question_generation
- Dataset: task821_protoqa_question_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 5 tokens
- mean: 14.6 tokens
- max: 61 tokens
- min: 5 tokens
- mean: 14.95 tokens
- max: 35 tokens
- min: 5 tokens
- mean: 13.89 tokens
- max: 93 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task493_review_polarity_classification
- Dataset: task493_review_polarity_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 18 tokens
- mean: 100.91 tokens
- max: 256 tokens
- min: 19 tokens
- mean: 107.28 tokens
- max: 256 tokens
- min: 14 tokens
- mean: 113.07 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task308_jeopardy_answer_generation_all
- Dataset: task308_jeopardy_answer_generation_all
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 12 tokens
- mean: 27.9 tokens
- max: 50 tokens
- min: 10 tokens
- mean: 26.98 tokens
- max: 44 tokens
- min: 9 tokens
- mean: 27.48 tokens
- max: 48 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1595_event2mind_text_generation_1
- Dataset: task1595_event2mind_text_generation_1
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 6 tokens
- mean: 9.86 tokens
- max: 18 tokens
- min: 6 tokens
- mean: 9.97 tokens
- max: 20 tokens
- min: 6 tokens
- mean: 10.02 tokens
- max: 20 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task040_qasc_question_generation
- Dataset: task040_qasc_question_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 8 tokens
- mean: 15.04 tokens
- max: 29 tokens
- min: 7 tokens
- mean: 15.05 tokens
- max: 30 tokens
- min: 8 tokens
- mean: 13.84 tokens
- max: 32 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task231_iirc_link_classification
- Dataset: task231_iirc_link_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 179 tokens
- mean: 246.31 tokens
- max: 256 tokens
- min: 170 tokens
- mean: 245.93 tokens
- max: 256 tokens
- min: 161 tokens
- mean: 247.13 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1727_wiqa_what_is_the_effect
- Dataset: task1727_wiqa_what_is_the_effect
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 44 tokens
- mean: 95.17 tokens
- max: 183 tokens
- min: 44 tokens
- mean: 95.18 tokens
- max: 185 tokens
- min: 43 tokens
- mean: 95.42 tokens
- max: 183 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task578_curiosity_dialogs_answer_generation
- Dataset: task578_curiosity_dialogs_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 10 tokens
- mean: 229.66 tokens
- max: 256 tokens
- min: 118 tokens
- mean: 235.49 tokens
- max: 256 tokens
- min: 12 tokens
- mean: 229.46 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task310_race_classification
- Dataset: task310_race_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 101 tokens
- mean: 254.9 tokens
- max: 256 tokens
- min: 218 tokens
- mean: 255.78 tokens
- max: 256 tokens
- min: 101 tokens
- mean: 254.9 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task309_race_answer_generation
- Dataset: task309_race_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 75 tokens
- mean: 254.99 tokens
- max: 256 tokens
- min: 204 tokens
- mean: 255.6 tokens
- max: 256 tokens
- min: 75 tokens
- mean: 255.19 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task379_agnews_topic_classification
- Dataset: task379_agnews_topic_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 20 tokens
- mean: 54.89 tokens
- max: 193 tokens
- min: 20 tokens
- mean: 54.64 tokens
- max: 175 tokens
- min: 21 tokens
- mean: 54.78 tokens
- max: 187 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task030_winogrande_full_person
- Dataset: task030_winogrande_full_person
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 7 tokens
- mean: 7.59 tokens
- max: 12 tokens
- min: 7 tokens
- mean: 7.49 tokens
- max: 12 tokens
- min: 7 tokens
- mean: 7.38 tokens
- max: 11 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1540_parsed_pdfs_summarization
- Dataset: task1540_parsed_pdfs_summarization
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 3 tokens
- mean: 188.4 tokens
- max: 256 tokens
- min: 46 tokens
- mean: 190.16 tokens
- max: 256 tokens
- min: 3 tokens
- mean: 192.07 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task039_qasc_find_overlapping_words
- Dataset: task039_qasc_find_overlapping_words
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 16 tokens
- mean: 30.48 tokens
- max: 55 tokens
- min: 16 tokens
- mean: 30.05 tokens
- max: 57 tokens
- min: 16 tokens
- mean: 30.65 tokens
- max: 60 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1206_atomic_classification_isbefore
- Dataset: task1206_atomic_classification_isbefore
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 14 tokens
- mean: 21.2 tokens
- max: 40 tokens
- min: 14 tokens
- mean: 20.77 tokens
- max: 31 tokens
- min: 14 tokens
- mean: 21.41 tokens
- max: 31 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task157_count_vowels_and_consonants
- Dataset: task157_count_vowels_and_consonants
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 24 tokens
- mean: 28.0 tokens
- max: 41 tokens
- min: 24 tokens
- mean: 27.91 tokens
- max: 41 tokens
- min: 24 tokens
- mean: 28.3 tokens
- max: 39 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task339_record_answer_generation
- Dataset: task339_record_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 171 tokens
- mean: 235.1 tokens
- max: 256 tokens
- min: 171 tokens
- mean: 234.38 tokens
- max: 256 tokens
- min: 171 tokens
- mean: 232.38 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task453_swag_answer_generation
- Dataset: task453_swag_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 9 tokens
- mean: 18.56 tokens
- max: 60 tokens
- min: 9 tokens
- mean: 18.16 tokens
- max: 63 tokens
- min: 9 tokens
- mean: 17.5 tokens
- max: 55 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task848_pubmedqa_classification
- Dataset: task848_pubmedqa_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 21 tokens
- mean: 248.87 tokens
- max: 256 tokens
- min: 21 tokens
- mean: 250.0 tokens
- max: 256 tokens
- min: 84 tokens
- mean: 251.62 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task673_google_wellformed_query_classification
- Dataset: task673_google_wellformed_query_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 6 tokens
- mean: 11.6 tokens
- max: 27 tokens
- min: 6 tokens
- mean: 11.22 tokens
- max: 24 tokens
- min: 6 tokens
- mean: 11.34 tokens
- max: 22 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task676_ollie_relationship_answer_generation
- Dataset: task676_ollie_relationship_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 29 tokens
- mean: 50.99 tokens
- max: 113 tokens
- min: 29 tokens
- mean: 49.39 tokens
- max: 134 tokens
- min: 30 tokens
- mean: 51.48 tokens
- max: 113 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task268_casehold_legal_answer_generation
- Dataset: task268_casehold_legal_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 235 tokens
- mean: 255.96 tokens
- max: 256 tokens
- min: 156 tokens
- mean: 255.46 tokens
- max: 256 tokens
- min: 226 tokens
- mean: 255.94 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task844_financial_phrasebank_classification
- Dataset: task844_financial_phrasebank_classification
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 14 tokens
- mean: 39.8 tokens
- max: 86 tokens
- min: 13 tokens
- mean: 38.45 tokens
- max: 78 tokens
- min: 15 tokens
- mean: 39.06 tokens
- max: 86 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task330_gap_answer_generation
- Dataset: task330_gap_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 26 tokens
- mean: 106.78 tokens
- max: 256 tokens
- min: 44 tokens
- mean: 108.12 tokens
- max: 256 tokens
- min: 45 tokens
- mean: 110.93 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task595_mocha_answer_generation
- Dataset: task595_mocha_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 44 tokens
- mean: 94.08 tokens
- max: 178 tokens
- min: 21 tokens
- mean: 97.06 tokens
- max: 256 tokens
- min: 19 tokens
- mean: 118.77 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task1285_kpa_keypoint_matching
- Dataset: task1285_kpa_keypoint_matching
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 30 tokens
- mean: 52.36 tokens
- max: 92 tokens
- min: 29 tokens
- mean: 50.14 tokens
- max: 84 tokens
- min: 31 tokens
- mean: 53.21 tokens
- max: 88 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task234_iirc_passage_line_answer_generation
- Dataset: task234_iirc_passage_line_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 143 tokens
- mean: 235.25 tokens
- max: 256 tokens
- min: 155 tokens
- mean: 235.25 tokens
- max: 256 tokens
- min: 146 tokens
- mean: 236.25 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task494_review_polarity_answer_generation
- Dataset: task494_review_polarity_answer_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 3 tokens
- mean: 106.0 tokens
- max: 256 tokens
- min: 23 tokens
- mean: 112.36 tokens
- max: 256 tokens
- min: 20 tokens
- mean: 112.66 tokens
- max: 249 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task670_ambigqa_question_generation
- Dataset: task670_ambigqa_question_generation
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 11 tokens
- mean: 12.66 tokens
- max: 26 tokens
- min: 11 tokens
- mean: 12.48 tokens
- max: 23 tokens
- min: 11 tokens
- mean: 12.24 tokens
- max: 18 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
task289_gigaword_summarization
- Dataset: task289_gigaword_summarization
- Size: 1,018 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 25 tokens
- mean: 51.53 tokens
- max: 87 tokens
- min: 27 tokens
- mean: 52.0 tokens
- max: 87 tokens
- min: 25 tokens
- mean: 51.44 tokens
- max: 87 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
npr
- Dataset: npr
- Size: 24,838 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 4 tokens
- mean: 12.74 tokens
- max: 32 tokens
- min: 12 tokens
- mean: 152.32 tokens
- max: 256 tokens
- min: 14 tokens
- mean: 119.75 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
nli
- Dataset: nli
- Size: 49,676 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 5 tokens
- mean: 21.62 tokens
- max: 108 tokens
- min: 4 tokens
- mean: 12.07 tokens
- max: 50 tokens
- min: 4 tokens
- mean: 12.21 tokens
- max: 44 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
SimpleWiki
- Dataset: SimpleWiki
- Size: 5,070 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 8 tokens
- mean: 29.35 tokens
- max: 256 tokens
- min: 8 tokens
- mean: 33.94 tokens
- max: 256 tokens
- min: 10 tokens
- mean: 56.42 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
amazon_review_2018
- Dataset: amazon_review_2018
- Size: 99,352 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 4 tokens
- mean: 11.86 tokens
- max: 33 tokens
- min: 11 tokens
- mean: 88.89 tokens
- max: 256 tokens
- min: 11 tokens
- mean: 70.8 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
ccnews_title_text
- Dataset: ccnews_title_text
- Size: 24,838 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 6 tokens
- mean: 15.24 tokens
- max: 59 tokens
- min: 21 tokens
- mean: 210.26 tokens
- max: 256 tokens
- min: 20 tokens
- mean: 194.92 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
agnews
- Dataset: agnews
- Size: 44,606 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 5 tokens
- mean: 11.73 tokens
- max: 38 tokens
- min: 10 tokens
- mean: 39.85 tokens
- max: 256 tokens
- min: 13 tokens
- mean: 45.43 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
xsum
- Dataset: xsum
- Size: 10,140 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 8 tokens
- mean: 27.77 tokens
- max: 58 tokens
- min: 14 tokens
- mean: 226.87 tokens
- max: 256 tokens
- min: 41 tokens
- mean: 232.14 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
msmarco
- Dataset: msmarco
- Size: 173,354 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 4 tokens
- mean: 9.07 tokens
- max: 25 tokens
- min: 19 tokens
- mean: 82.14 tokens
- max: 237 tokens
- min: 19 tokens
- mean: 80.54 tokens
- max: 252 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
yahoo_answers_title_answer
- Dataset: yahoo_answers_title_answer
- Size: 24,838 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 6 tokens
- mean: 16.73 tokens
- max: 45 tokens
- min: 5 tokens
- mean: 82.94 tokens
- max: 256 tokens
- min: 7 tokens
- mean: 86.15 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
squad_pairs
- Dataset: squad_pairs
- Size: 24,838 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 7 tokens
- mean: 14.05 tokens
- max: 38 tokens
- min: 32 tokens
- mean: 153.91 tokens
- max: 256 tokens
- min: 34 tokens
- mean: 162.67 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
wow
- Dataset: wow
- Size: 29,908 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 3 tokens
- mean: 88.36 tokens
- max: 256 tokens
- min: 100 tokens
- mean: 112.02 tokens
- max: 150 tokens
- min: 83 tokens
- mean: 113.07 tokens
- max: 147 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
mteb-amazon_counterfactual-avs_triplets
- Dataset: mteb-amazon_counterfactual-avs_triplets
- Size: 4,055 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 12 tokens
- mean: 27.68 tokens
- max: 137 tokens
- min: 12 tokens
- mean: 26.84 tokens
- max: 137 tokens
- min: 12 tokens
- mean: 26.34 tokens
- max: 91 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
mteb-amazon_massive_intent-avs_triplets
- Dataset: mteb-amazon_massive_intent-avs_triplets
- Size: 11,661 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 3 tokens
- mean: 9.5 tokens
- max: 28 tokens
- min: 3 tokens
- mean: 9.05 tokens
- max: 26 tokens
- min: 3 tokens
- mean: 9.45 tokens
- max: 25 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
mteb-amazon_massive_scenario-avs_triplets
- Dataset: mteb-amazon_massive_scenario-avs_triplets
- Size: 11,661 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 3 tokens
- mean: 9.62 tokens
- max: 39 tokens
- min: 3 tokens
- mean: 9.19 tokens
- max: 29 tokens
- min: 3 tokens
- mean: 9.59 tokens
- max: 24 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
mteb-amazon_reviews_multi-avs_triplets
- Dataset: mteb-amazon_reviews_multi-avs_triplets
- Size: 198,192 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 7 tokens
- mean: 49.55 tokens
- max: 256 tokens
- min: 6 tokens
- mean: 49.51 tokens
- max: 256 tokens
- min: 8 tokens
- mean: 48.42 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
mteb-banking77-avs_triplets
- Dataset: mteb-banking77-avs_triplets
- Size: 10,139 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 4 tokens
- mean: 15.81 tokens
- max: 73 tokens
- min: 6 tokens
- mean: 15.77 tokens
- max: 73 tokens
- min: 5 tokens
- mean: 16.1 tokens
- max: 73 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
mteb-emotion-avs_triplets
- Dataset: mteb-emotion-avs_triplets
- Size: 16,224 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 5 tokens
- mean: 22.04 tokens
- max: 67 tokens
- min: 5 tokens
- mean: 17.71 tokens
- max: 65 tokens
- min: 5 tokens
- mean: 21.99 tokens
- max: 72 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
mteb-imdb-avs_triplets
- Dataset: mteb-imdb-avs_triplets
- Size: 24,839 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 34 tokens
- mean: 207.67 tokens
- max: 256 tokens
- min: 36 tokens
- mean: 223.93 tokens
- max: 256 tokens
- min: 42 tokens
- mean: 206.87 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
mteb-mtop_domain-avs_triplets
- Dataset: mteb-mtop_domain-avs_triplets
- Size: 15,715 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 4 tokens
- mean: 10.27 tokens
- max: 32 tokens
- min: 4 tokens
- mean: 9.62 tokens
- max: 24 tokens
- min: 4 tokens
- mean: 10.01 tokens
- max: 33 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
mteb-mtop_intent-avs_triplets
- Dataset: mteb-mtop_intent-avs_triplets
- Size: 15,715 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 3 tokens
- mean: 10.22 tokens
- max: 35 tokens
- min: 4 tokens
- mean: 9.74 tokens
- max: 27 tokens
- min: 3 tokens
- mean: 10.43 tokens
- max: 28 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
mteb-toxic_conversations_50k-avs_triplets
- Dataset: mteb-toxic_conversations_50k-avs_triplets
- Size: 49,677 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 3 tokens
- mean: 67.17 tokens
- max: 256 tokens
- min: 3 tokens
- mean: 88.29 tokens
- max: 256 tokens
- min: 3 tokens
- mean: 64.96 tokens
- max: 252 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
mteb-tweet_sentiment_extraction-avs_triplets
- Dataset: mteb-tweet_sentiment_extraction-avs_triplets
- Size: 27,373 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 3 tokens
- mean: 20.58 tokens
- max: 45 tokens
- min: 2 tokens
- mean: 20.26 tokens
- max: 56 tokens
- min: 3 tokens
- mean: 21.1 tokens
- max: 59 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
covid-bing-query-gpt4-avs_triplets
- Dataset: covid-bing-query-gpt4-avs_triplets
- Size: 5,070 training samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 6 tokens
- mean: 15.28 tokens
- max: 33 tokens
- min: 14 tokens
- mean: 37.6 tokens
- max: 92 tokens
- min: 16 tokens
- mean: 38.13 tokens
- max: 239 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
Evaluation Dataset
Unnamed Dataset
- Size: 18,269 evaluation samples
- Columns:
anchor
,positive
, andnegative
- Approximate statistics based on the first 1000 samples:
anchor positive negative type string string string details - min: 4 tokens
- mean: 16.04 tokens
- max: 55 tokens
- min: 5 tokens
- mean: 142.75 tokens
- max: 256 tokens
- min: 5 tokens
- mean: 144.56 tokens
- max: 256 tokens
- Samples:
- Loss:
MultipleNegativesRankingLoss
with these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim" }
Training Hyperparameters
Non-Default Hyperparameters
eval_strategy
: stepsper_device_train_batch_size
: 512per_device_eval_batch_size
: 512learning_rate
: 2e-05num_train_epochs
: 10warmup_ratio
: 0.1fp16
: Truegradient_checkpointing
: Truebatch_sampler
: no_duplicates
All Hyperparameters
Click to expand
overwrite_output_dir
: Falsedo_predict
: Falseeval_strategy
: stepsprediction_loss_only
: Trueper_device_train_batch_size
: 512per_device_eval_batch_size
: 512per_gpu_train_batch_size
: Noneper_gpu_eval_batch_size
: Nonegradient_accumulation_steps
: 1eval_accumulation_steps
: Nonelearning_rate
: 2e-05weight_decay
: 0.0adam_beta1
: 0.9adam_beta2
: 0.999adam_epsilon
: 1e-08max_grad_norm
: 1.0num_train_epochs
: 10max_steps
: -1lr_scheduler_type
: linearlr_scheduler_kwargs
: {}warmup_ratio
: 0.1warmup_steps
: 0log_level
: passivelog_level_replica
: warninglog_on_each_node
: Truelogging_nan_inf_filter
: Truesave_safetensors
: Truesave_on_each_node
: Falsesave_only_model
: Falserestore_callback_states_from_checkpoint
: Falseno_cuda
: Falseuse_cpu
: Falseuse_mps_device
: Falseseed
: 42data_seed
: Nonejit_mode_eval
: Falseuse_ipex
: Falsebf16
: Falsefp16
: Truefp16_opt_level
: O1half_precision_backend
: autobf16_full_eval
: Falsefp16_full_eval
: Falsetf32
: Nonelocal_rank
: 0ddp_backend
: Nonetpu_num_cores
: Nonetpu_metrics_debug
: Falsedebug
: []dataloader_drop_last
: Falsedataloader_num_workers
: 0dataloader_prefetch_factor
: Nonepast_index
: -1disable_tqdm
: Falseremove_unused_columns
: Truelabel_names
: Noneload_best_model_at_end
: Falseignore_data_skip
: Falsefsdp
: []fsdp_min_num_params
: 0fsdp_config
: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}fsdp_transformer_layer_cls_to_wrap
: Noneaccelerator_config
: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}deepspeed
: Nonelabel_smoothing_factor
: 0.0optim
: adamw_torchoptim_args
: Noneadafactor
: Falsegroup_by_length
: Falselength_column_name
: lengthddp_find_unused_parameters
: Noneddp_bucket_cap_mb
: Noneddp_broadcast_buffers
: Falsedataloader_pin_memory
: Truedataloader_persistent_workers
: Falseskip_memory_metrics
: Trueuse_legacy_prediction_loop
: Falsepush_to_hub
: Falseresume_from_checkpoint
: Nonehub_model_id
: Nonehub_strategy
: every_savehub_private_repo
: Falsehub_always_push
: Falsegradient_checkpointing
: Truegradient_checkpointing_kwargs
: Noneinclude_inputs_for_metrics
: Falseeval_do_concat_batches
: Truefp16_backend
: autopush_to_hub_model_id
: Nonepush_to_hub_organization
: Nonemp_parameters
:auto_find_batch_size
: Falsefull_determinism
: Falsetorchdynamo
: Noneray_scope
: lastddp_timeout
: 1800torch_compile
: Falsetorch_compile_backend
: Nonetorch_compile_mode
: Nonedispatch_batches
: Nonesplit_batches
: Noneinclude_tokens_per_second
: Falseinclude_num_input_tokens_seen
: Falseneftune_noise_alpha
: Noneoptim_target_modules
: Nonebatch_eval_metrics
: Falseeval_on_start
: Falsebatch_sampler
: no_duplicatesmulti_dataset_batch_sampler
: proportional
Training Logs
Epoch | Step | Training Loss | loss | medi-mteb-dev_max_accuracy |
---|---|---|---|---|
0 | 0 | - | - | 0.8705 |
0.1308 | 500 | 2.1744 | 1.5723 | 0.8786 |
0.2616 | 1000 | 1.9245 | 1.5045 | 0.8851 |
0.3925 | 1500 | 1.9833 | 1.4719 | 0.8882 |
0.5233 | 2000 | 1.7492 | 1.4434 | 0.8909 |
0.6541 | 2500 | 1.8815 | 1.4244 | 0.8935 |
0.7849 | 3000 | 1.7921 | 1.4064 | 0.8949 |
0.9158 | 3500 | 1.8495 | 1.3894 | 0.8956 |
1.0466 | 4000 | 1.7415 | 1.3744 | 0.8966 |
1.1774 | 4500 | 1.8663 | 1.3619 | 0.9005 |
1.3082 | 5000 | 1.7016 | 1.3520 | 0.8979 |
1.4390 | 5500 | 1.7308 | 1.3467 | 0.9007 |
1.5699 | 6000 | 1.6965 | 1.3346 | 0.9021 |
1.7007 | 6500 | 1.7355 | 1.3251 | 0.9018 |
1.8315 | 7000 | 1.6783 | 1.3156 | 0.9031 |
1.9623 | 7500 | 1.6381 | 1.3101 | 0.9047 |
2.0931 | 8000 | 1.7169 | 1.3056 | 0.9044 |
2.2240 | 8500 | 1.6527 | 1.3070 | 0.9039 |
2.3548 | 9000 | 1.7078 | 1.2977 | 0.9055 |
2.4856 | 9500 | 1.533 | 1.2991 | 0.9050 |
2.6164 | 10000 | 1.6676 | 1.2916 | 0.9057 |
2.7473 | 10500 | 1.5866 | 1.2885 | 0.9053 |
2.8781 | 11000 | 1.641 | 1.2765 | 0.9066 |
3.0089 | 11500 | 1.5193 | 1.2816 | 0.9062 |
3.1397 | 12000 | 1.6907 | 1.2804 | 0.9065 |
3.2705 | 12500 | 1.557 | 1.2684 | 0.9065 |
3.4014 | 13000 | 1.6808 | 1.2711 | 0.9075 |
3.5322 | 13500 | 1.4751 | 1.2700 | 0.9072 |
3.6630 | 14000 | 1.5934 | 1.2692 | 0.9081 |
3.7938 | 14500 | 1.5395 | 1.2672 | 0.9087 |
3.9246 | 15000 | 1.5809 | 1.2678 | 0.9072 |
4.0555 | 15500 | 1.4972 | 1.2621 | 0.9089 |
4.1863 | 16000 | 1.614 | 1.2690 | 0.9070 |
4.3171 | 16500 | 1.5186 | 1.2625 | 0.9091 |
4.4479 | 17000 | 1.5239 | 1.2629 | 0.9079 |
4.5788 | 17500 | 1.5354 | 1.2569 | 0.9086 |
4.7096 | 18000 | 1.5134 | 1.2559 | 0.9095 |
4.8404 | 18500 | 1.5237 | 1.2494 | 0.9100 |
4.9712 | 19000 | 1.5038 | 1.2486 | 0.9113 |
5.1020 | 19500 | 1.5527 | 1.2493 | 0.9098 |
5.2329 | 20000 | 1.5018 | 1.2521 | 0.9102 |
5.3637 | 20500 | 1.584 | 1.2496 | 0.9095 |
5.4945 | 21000 | 1.3948 | 1.2467 | 0.9102 |
5.6253 | 21500 | 1.5118 | 1.2487 | 0.9098 |
5.7561 | 22000 | 1.458 | 1.2471 | 0.9098 |
5.8870 | 22500 | 1.5158 | 1.2367 | 0.9105 |
6.0178 | 23000 | 1.4091 | 1.2480 | 0.9096 |
6.1486 | 23500 | 1.5823 | 1.2456 | 0.9114 |
6.2794 | 24000 | 1.4383 | 1.2404 | 0.9101 |
6.4103 | 24500 | 1.5606 | 1.2431 | 0.9100 |
6.5411 | 25000 | 1.3906 | 1.2386 | 0.9112 |
6.6719 | 25500 | 1.4887 | 1.2382 | 0.9103 |
6.8027 | 26000 | 1.4347 | 1.2384 | 0.9112 |
6.9335 | 26500 | 1.4733 | 1.2395 | 0.9113 |
7.0644 | 27000 | 1.4323 | 1.2385 | 0.9111 |
7.1952 | 27500 | 1.505 | 1.2413 | 0.9107 |
7.3260 | 28000 | 1.4648 | 1.2362 | 0.9114 |
7.4568 | 28500 | 1.4252 | 1.2361 | 0.9116 |
7.5877 | 29000 | 1.458 | 1.2344 | 0.9118 |
7.7185 | 29500 | 1.4309 | 1.2357 | 0.9120 |
7.8493 | 30000 | 1.4431 | 1.2330 | 0.9114 |
7.9801 | 30500 | 1.4266 | 1.2306 | 0.9127 |
8.1109 | 31000 | 1.4803 | 1.2328 | 0.9118 |
8.2418 | 31500 | 1.414 | 1.2345 | 0.9110 |
8.3726 | 32000 | 1.5456 | 1.2343 | 0.9116 |
8.5034 | 32500 | 1.346 | 1.2324 | 0.9118 |
8.6342 | 33000 | 1.4467 | 1.2315 | 0.9118 |
8.7650 | 33500 | 1.3864 | 1.2330 | 0.9119 |
8.8959 | 34000 | 1.4806 | 1.2277 | 0.9119 |
9.0267 | 34500 | 1.3381 | 1.2330 | 0.9119 |
9.1575 | 35000 | 1.5277 | 1.2315 | 0.9121 |
9.2883 | 35500 | 1.3966 | 1.2309 | 0.9112 |
9.4192 | 36000 | 1.4921 | 1.2321 | 0.9117 |
9.5500 | 36500 | 1.3668 | 1.2303 | 0.9118 |
9.6808 | 37000 | 1.4407 | 1.2308 | 0.9121 |
9.8116 | 37500 | 1.3852 | 1.2314 | 0.9118 |
9.9424 | 38000 | 1.4329 | 1.2300 | 0.9120 |
Framework Versions
- Python: 3.10.10
- Sentence Transformers: 3.1.0.dev0
- Transformers: 4.42.4
- PyTorch: 2.3.1+cu121
- Accelerate: 0.32.1
- Datasets: 2.20.0
- Tokenizers: 0.19.1
Citation
BibTeX
Sentence Transformers
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
MultipleNegativesRankingLoss
@misc{henderson2017efficient,
title={Efficient Natural Language Response Suggestion for Smart Reply},
author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
year={2017},
eprint={1705.00652},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
- Downloads last month
- 29
Model tree for avsolatorio/all-MiniLM-L6-v2-MEDI-MTEB-triplet-final
Base model
sentence-transformers/all-MiniLM-L6-v2Evaluation results
- Cosine Accuracy on medi mteb devself-reported0.912
- Dot Accuracy on medi mteb devself-reported0.081
- Manhattan Accuracy on medi mteb devself-reported0.912
- Euclidean Accuracy on medi mteb devself-reported0.911
- Max Accuracy on medi mteb devself-reported0.912