BioMamba: A Pre-trained Biomedical Language Representation Model Leveraging Mamba Paper • 2408.02600 • Published Aug 5, 2024 • 11
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning Paper • 2503.05592 • Published 6 days ago • 24
steiner-preview Collection Reasoning models trained on synthetic data using reinforcement learning. • 3 items • Updated Oct 20, 2024 • 32
view article Article A failed experiment: Infini-Attention, and why we should keep trying? Aug 14, 2024 • 60
Magic 1-For-1: Generating One Minute Video Clips within One Minute Paper • 2502.07701 • Published 30 days ago • 34
Llama-3.1-Nemotron-70B Collection SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated Jan 17 • 153
view article Article Using 🤗 to Train a GPT-2 Model for Music Generation By juancopi81 • Oct 5, 2023 • 8
Nemotron 4 340B Collection Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated Jan 17 • 162