LLM as a Broken Telephone: Iterative Generation Distorts Information Paper • 2502.20258 • Published Feb 27 • 26
It's All in The [MASK]: Simple Instruction-Tuning Enables BERT-like Masked Language Models As Generative Classifiers Paper • 2502.03793 • Published Feb 6 • 4
AraModernBERT Models Collection AraModernBert is an advanced Arabic language model built on the ModernBERT architecture. • 2 items • Updated 28 days ago • 3
ProKeR: A Kernel Perspective on Few-Shot Adaptation of Large Vision-Language Models Paper • 2501.11175 • Published Jan 19 • 3
EASY: Ensemble Augmented-Shot Y-shaped Learning: State-Of-The-Art Few-Shot Classification with Simple Ingredients Paper • 2201.09699 • Published Jan 24, 2022 • 2
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated Feb 20 • 251
Arabic (MSA) Summarization Models & Datasets Collection A collection of models (and the dataset used to train them) that are trained for summarizing arabic text. • 5 items • Updated Feb 20 • 1
Translation Models & Datasets Collection English to Moroccan darija (ary) models • 16 items • Updated 27 days ago • 1
Moroccan Darija Datasets Collection A collection of all available datasets for pretraining LLMs • 12 items • Updated Feb 20 • 1
Moroccan Darija Embeddings Models & Datasets Collection Sentence and word embedding models for Moroccan darija (ary) • 8 items • Updated Mar 2 • 1
Moroccan Darija LLMs Collection Language Models that speaks Moroccan darija (ary) • 9 items • Updated Feb 20 • 1
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 170