RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published Nov 19, 2024 • 53
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order Paper • 2404.00399 • Published Mar 30, 2024 • 42
DMLR: Data-centric Machine Learning Research -- Past, Present and Future Paper • 2311.13028 • Published Nov 21, 2023 • 1
DeltaZip: Multi-Tenant Language Model Serving via Delta Compression Paper • 2312.05215 • Published Dec 8, 2023 • 1
eth-easl/pythia_2.8b_deduped-task1336_peixian_equity_evaluation_corpus_gender_classifier Updated Sep 1, 2023 • 1
eth-easl/pythia_2.8b_deduped-task065_timetravel_consistent_sentence_classification Text Generation • Updated Sep 1, 2023 • 8