Multilingual for Translation Corpus Helsinki-NLP/opus_books Viewer • Updated Mar 29, 2024 • 1.25M • 14.8k • 91
models rasa/LaBSE Feature Extraction • Updated May 20, 2021 • 8.91k • • 23 nomic-ai/nomic-embed-text-v1.5 Sentence Similarity • 0.1B • Updated Apr 7 • 17.4M • 837 NovaSearch/stella_en_1.5B_v5 Sentence Similarity • 2B • Updated Jul 28, 2025 • 41k • 261 llmware/llama-3.2-1b-gguf 1B • Updated Feb 8, 2025 • 12 • 1
Vietnamese ngtoanrob/vien-translation Translation • Updated Feb 24, 2023 • 7 • 1 ngtoanrob/envi-translation Updated Apr 1, 2023 • 8 • 1 gozu888/Envit5-tuned Translation • 0.3B • Updated Jun 28, 2023 • 18 • 3 IWSLT/mt_eng_vietnamese Updated Jan 18, 2024 • 563 • 30
Wish list HuggingFaceH4/ultrachat_200k Viewer • Updated Oct 16, 2024 • 515k • 73.3k • 718 bookcorpus/bookcorpus Updated May 3, 2024 • 21.8k • 354 sentence-transformers/wikipedia-en-sentences Viewer • Updated Apr 25, 2024 • 7.87M • 681 • 7 sentence-transformers/paq Viewer • Updated May 1, 2024 • 64.4M • 326 • 2
LLMs TheBloke/Llama-2-13B-chat-GGML Text Generation • Updated Sep 27, 2023 • 764 • 693 TheBloke/Llama-2-7B-32K-Instruct-GGML Updated Sep 27, 2023 • 7 • 8 openchat/openchat-3.6-8b-20240522 Text Generation • 8B • Updated May 28, 2024 • 11.5k • • 157
corpuses Skylion007/openwebtext Viewer • Updated Dec 26, 2025 • 8.01M • 64.1k • 513 humarin/chatgpt-paraphrases Viewer • Updated Apr 5, 2023 • 419k • 267 • 59 stanford-oval/ccnews Viewer • Updated Aug 31, 2024 • 893M • 6.4k • 36 stanford-oval/wikipedia Viewer • Updated Apr 29, 2025 • 345M • 1.56k • 14
Multilingual for Translation Corpus Helsinki-NLP/opus_books Viewer • Updated Mar 29, 2024 • 1.25M • 14.8k • 91
Wish list HuggingFaceH4/ultrachat_200k Viewer • Updated Oct 16, 2024 • 515k • 73.3k • 718 bookcorpus/bookcorpus Updated May 3, 2024 • 21.8k • 354 sentence-transformers/wikipedia-en-sentences Viewer • Updated Apr 25, 2024 • 7.87M • 681 • 7 sentence-transformers/paq Viewer • Updated May 1, 2024 • 64.4M • 326 • 2
models rasa/LaBSE Feature Extraction • Updated May 20, 2021 • 8.91k • • 23 nomic-ai/nomic-embed-text-v1.5 Sentence Similarity • 0.1B • Updated Apr 7 • 17.4M • 837 NovaSearch/stella_en_1.5B_v5 Sentence Similarity • 2B • Updated Jul 28, 2025 • 41k • 261 llmware/llama-3.2-1b-gguf 1B • Updated Feb 8, 2025 • 12 • 1
LLMs TheBloke/Llama-2-13B-chat-GGML Text Generation • Updated Sep 27, 2023 • 764 • 693 TheBloke/Llama-2-7B-32K-Instruct-GGML Updated Sep 27, 2023 • 7 • 8 openchat/openchat-3.6-8b-20240522 Text Generation • 8B • Updated May 28, 2024 • 11.5k • • 157
Vietnamese ngtoanrob/vien-translation Translation • Updated Feb 24, 2023 • 7 • 1 ngtoanrob/envi-translation Updated Apr 1, 2023 • 8 • 1 gozu888/Envit5-tuned Translation • 0.3B • Updated Jun 28, 2023 • 18 • 3 IWSLT/mt_eng_vietnamese Updated Jan 18, 2024 • 563 • 30
corpuses Skylion007/openwebtext Viewer • Updated Dec 26, 2025 • 8.01M • 64.1k • 513 humarin/chatgpt-paraphrases Viewer • Updated Apr 5, 2023 • 419k • 267 • 59 stanford-oval/ccnews Viewer • Updated Aug 31, 2024 • 893M • 6.4k • 36 stanford-oval/wikipedia Viewer • Updated Apr 29, 2025 • 345M • 1.56k • 14