view article Article Introducing the Synthetic Data Generator - Build Datasets with Natural Language Dec 16, 2024 • 124
view article Article Introducing RWKV — An RNN with the advantages of a transformer May 15, 2023 • 21
FFN Fusion: Rethinking Sequential Computation in Large Language Models Paper • 2503.18908 • Published 29 days ago • 18
view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 988
Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models Paper • 2501.11873 • Published Jan 21 • 66
FAST: Efficient Action Tokenization for Vision-Language-Action Models Paper • 2501.09747 • Published Jan 16 • 23
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 • 235