view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 • 175
How FaR Are Large Language Models From Agents with Theory-of-Mind? Paper • 2310.03051 • Published Oct 4, 2023 • 35
Training Language Models to Self-Correct via Reinforcement Learning Paper • 2409.12917 • Published Sep 19, 2024 • 138
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA May 24, 2023 • 129
Open-Bezoar Collection Small, Cost-Effective and Open Models Trained on Mixes of Instruction Data • 7 items • Updated Apr 19, 2024 • 6