Bringing Fusion Down to Earth: ML for Stellarator Optimization By cgeorgiaw • about 14 hours ago • 38
Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub By nvidia and 10 others • 5 days ago • 23
Common Pitfalls in Sharing Open Source Models on Hugging Face (and How to Dodge Them) By FriendliAI and 2 others • about 24 hours ago • 17
Should We Still Pretrain Encoders with Masked Language Modeling? By Nicolas-BZRD and 3 others • about 11 hours ago • 6
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 172
How Much Power does a SOTA Open Video Model Use? ⚡🎥 By jdelavande and 2 others • about 7 hours ago • 5
Bringing Fusion Down to Earth: ML for Stellarator Optimization By cgeorgiaw • about 14 hours ago • 38
Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub By nvidia and 10 others • 5 days ago • 23
Common Pitfalls in Sharing Open Source Models on Hugging Face (and How to Dodge Them) By FriendliAI and 2 others • about 24 hours ago • 17
Should We Still Pretrain Encoders with Masked Language Modeling? By Nicolas-BZRD and 3 others • about 11 hours ago • 6
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 172
How Much Power does a SOTA Open Video Model Use? ⚡🎥 By jdelavande and 2 others • about 7 hours ago • 5