view article Article Fine-tuning Llama 2 70B using PyTorch FSDP By smangrul and 3 others • Sep 13, 2023 • 22
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA By ybelkada and 4 others • May 24, 2023 • 130
view article Article Introducing RWKV — An RNN with the advantages of a transformer By BlinkDL and 3 others • May 15, 2023 • 17
view article Article How 🤗 Accelerate runs very large models thanks to PyTorch By sgugger • Sep 27, 2022 • 11
view article Article Incredibly Fast BLOOM Inference with DeepSpeed and Accelerate By stas and 1 other • Sep 16, 2022 • 1
view article Article Accelerate Large Model Training using DeepSpeed By smangrul and 1 other • Jun 28, 2022 • 5
view article Article Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel By smangrul and 1 other • May 2, 2022 • 4