view article Article The NLP Course is becoming the LLM Course! By burtenshaw and 9 others • 3 days ago • 46
view article Article Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques By jmamou and 8 others • 12 days ago • 17
view article Article Open R1: How to use OlympicCoder locally for coding? By burtenshaw and 4 others • 17 days ago • 56
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others • Jan 28 • 835
view article Article Universal Assisted Generation: Faster Decoding with Any Assistant Model By danielkorat and 7 others • Oct 29, 2024 • 55
view article Article Faster Assisted Generation with Dynamic Speculation By jmamou and 6 others • Oct 8, 2024 • 46
view article Article Llama can now see and run on your device - welcome Llama 3.2 By merve and 6 others • Sep 25, 2024 • 187
view article Article How NuminaMath Won the 1st AIMO Progress Prize By yfleureau and 7 others • Jul 11, 2024 • 118
view article Article Welcome Gemma 2 - Google's new open LLM By philschmid and 5 others • Jun 27, 2024 • 129
view article Article Preference Tuning LLMs with Direct Preference Optimization Methods By kashif and 4 others • Jan 18, 2024 • 53
view article Article Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face By lewtun and 6 others • Dec 11, 2023 • 12
view article Article SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit By ronenlap and 5 others • Dec 6, 2023 • 9
view article Article Fine-tuning Llama 2 70B using PyTorch FSDP By smangrul and 3 others • Sep 13, 2023 • 22
view article Article Code Llama: Llama 2 learns to code By philschmid and 7 others • Aug 25, 2023 • 9