view article Article TerjamaBench: A Cultural Benchmark for English-Darija Machine Translation By imomayiz • 3 days ago • 17
Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published 5 days ago • 66
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 5 days ago • 78
view article Article Synthetic dataset generation techniques: Self-Instruct By davanstrien • May 15, 2024 • 14
LLMs in Cyber Security Collection Papers and datasets from the Stratosphere lab related to applications of LLMs in security. • 5 items • Updated Oct 11, 2024 • 4
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 555
Open LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: • 60 items • Updated 20 minutes ago • 504
C4AI Aya 23 Collection Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. • 4 items • Updated Dec 3, 2024 • 51
Domain Specific Data Collection This is a collection of tools for building domain specific datasets using human domain expertise and synthetic data generation. • 3 items • Updated Dec 11, 2024 • 3
Arabic Aya DPO Datasets Collection Our synthetic DPO datasets for Arabic Aya. • 5 items • Updated Jun 4, 2024 • 4
view article Article 🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets By dvilasuero • Jun 4, 2024 • 73