🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 20 items • Updated 8 days ago • 123
👩💻 OlympicCoder Collection Reasoning datasets and models for competitive coding • 4 items • Updated 28 days ago • 16
olmOCR Collection olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 4 items • Updated 20 days ago • 103
mrm8488/modernbert-embed-base-ft-sts-spanish-matryoshka-768-64 Sentence Similarity • Updated Jan 10 • 1.1k • 2