ali-issa/eng_filtered_short_sentences_less_than_5_words_training_data_for_opus_aya_xnli_with_vocab Viewer • Updated 23 days ago • 142M • 172
ali-issa/new_eng_custom_tokenizer_filtered_short_sentences_less_than_5_words Updated 23 days ago • 23
ali-issa/eng_custom_tokenizer_filtered_short_sentences_less_than_5_words_training_data_for_opus_aya_xnli Updated 23 days ago • 29
ali-issa/eng_filtered_short_sentences_less_than_5_words_training_data_for_opus_aya_xnli Viewer • Updated 24 days ago • 142M • 227
ali-issa/arb_diacritized_tokenized_filtered_dataset_with_custom_tokenizer Viewer • Updated 28 days ago • 53.9M • 102
ali-issa/arb_diacritized_tokenized_filtered_dataset_with_arb-bpe-tokenizer-32768 Viewer • Updated Mar 5 • 141M • 323
ali-issa/new_removed_none_values_arb_filtered_and_diacritized_short_sentences_less_than_5_words Viewer • Updated Feb 12 • 141M • 200