2 140 5

Léo Hunout

hunoutl

AI & ML interests

AI Engineer working on Jean Zay supercomputer in France 🇫🇷

Recent Activity

upvoted a paper 1 day ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

upvoted a paper 1 day ago

Towards Best Practices for Open Datasets for LLM Training

upvoted a paper 1 day ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

View all activity

Organizations

hunoutl's activity

upvoted 8 papers 1 day ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 14 days ago • 295

Humanity's Last Exam

Paper • 2501.14249 • Published 12 days ago • 54

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published 6 days ago • 25

Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published 5 days ago • 17

s1: Simple test-time scaling

Paper • 2501.19393 • Published 5 days ago • 77

upvoted an article 2 days ago

Article

Welcome to Inference Providers on the Hub 🔥

9 days ago

• 243

upvoted 11 papers 14 days ago

Ultra-Sparse Memory Network

Paper • 2411.12364 • Published Nov 19, 2024 • 21

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 41

SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters

Paper • 2412.00174 • Published Nov 29, 2024 • 23

DeMo: Decoupled Momentum Optimization

Paper • 2411.19870 • Published Nov 29, 2024 • 6

Puzzle: Distillation-Based NAS for Inference-Optimized LLMs

Paper • 2411.19146 • Published Nov 28, 2024 • 17

Yi-Lightning Technical Report

Paper • 2412.01253 • Published Dec 2, 2024 • 27

Video Depth without Video Models

Paper • 2411.19189 • Published Nov 28, 2024 • 36

GRAPE: Generalizing Robot Policy via Preference Alignment

Paper • 2411.19309 • Published Nov 28, 2024 • 44

1.58-bit FLUX

Paper • 2412.18653 • Published Dec 24, 2024 • 74

ResearchTown: Simulator of Human Research Community

Paper • 2412.17767 • Published Dec 23, 2024 • 14

Large Action Models: From Inception to Implementation

Paper • 2412.10047 • Published Dec 13, 2024 • 33