Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK Nov 21, 2024 • 35
Follow The Money Collection https://docs.google.com/presentation/d/1heWC_K_vqWmK5W4Un1aK_wY-aywmjmp6di6vPAn3bns/edit?usp=sharing • 4 items • Updated 4 days ago • 1
view article Article Yay! Organizations can now publish blog Articles By huggingface • 5 days ago • 29
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published 11 days ago • 47
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 11 days ago • 121
view article Article Beyond Image Preferences - Rich Human Feedback for Text-to-Image Generation By RapidataAI • 16 days ago • 13
view article Article Crowd-sourced Open Preference Dataset for Text-to-Image Generation By RapidataAI • 18 days ago • 18
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 • 23 days ago • 31
view article Article Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 • 27 days ago • 26
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated Dec 22, 2024 • 207
Synthetic Data Generator Collection A collection of tools and datasets related to no-code the Synthetic Data Generation. • 19 items • Updated 5 days ago • 7
Smol but mighty Collection A collection of smoll but mighty models • 10 items • Updated 4 days ago • 4
Gradio WebRTC Cookbook ⚡️ Collection Collection of real-time voice and video demos built with gradio-webrtc custom component • 8 items • Updated Dec 10, 2024 • 17
Lora Land - 27 High-Quality LoRA Adapters Collection 27 Fine-tuned LoRA Adapters using Mistral-7B. Try them here: https://predibase.com/lora-land • 27 items • Updated Apr 26, 2024 • 4
Self-Instruct: Aligning Language Model with Self Generated Instructions Paper • 2212.10560 • Published Dec 20, 2022 • 9
view article Article 🐺🐦⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram • Dec 4, 2024 • 76