zephyr story sources mentioned by hf.co/thomwolf tweet: x.com/Thom_Wolf/status/1720503998518640703 HuggingFaceH4/zephyr-7b-beta Text Generation โข Updated Oct 16, 2024 โข 226k โข โข 1.64k mistralai/Mistral-7B-v0.1 Text Generation โข Updated Jul 24, 2024 โข 4.37M โข 3.52k stingning/ultrachat Viewer โข Updated Feb 22, 2024 โข 774k โข 1.21k โข 429 openbmb/UltraFeedback Viewer โข Updated Dec 29, 2023 โข 64k โข 1.32k โข 345
A little guide to building Large Language Models in 2024 Resources mentioned by @thomwolf in https://x.com/Thom_Wolf/status/1773340316835131757 Yi: Open Foundation Models by 01.AI Paper โข 2403.04652 โข Published Mar 7, 2024 โข 62 A Survey on Data Selection for Language Models Paper โข 2402.16827 โข Published Feb 26, 2024 โข 4 Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research Paper โข 2402.00159 โข Published Jan 31, 2024 โข 61 The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only Paper โข 2306.01116 โข Published Jun 1, 2023 โข 32
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research Paper โข 2402.00159 โข Published Jan 31, 2024 โข 61
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only Paper โข 2306.01116 โข Published Jun 1, 2023 โข 32