view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 546
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • 4 days ago • 22
Parallel Sentences Datasets Collection These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual. • 14 items • Updated Oct 9, 2024 • 14
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 20 days ago • 132
Deliberation in Latent Space via Differentiable Cache Augmentation Paper • 2412.17747 • Published Dec 23, 2024 • 30
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published Dec 13, 2024 • 89
emrecan/bert-base-turkish-cased-mean-nli-stsb-tr Sentence Similarity • Updated Jan 24, 2022 • 223k • 35