Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs Paper β’ 2503.01743 β’ Published 9 days ago β’ 72
Token-Efficient Long Video Understanding for Multimodal LLMs Paper β’ 2503.04130 β’ Published 7 days ago β’ 75
RuCCoD: Towards Automated ICD Coding in Russian Paper β’ 2502.21263 β’ Published 12 days ago β’ 120
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Paper β’ 2503.03601 β’ Published 8 days ago β’ 199
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper β’ 2502.02737 β’ Published Feb 4 β’ 203
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper β’ 2502.08946 β’ Published 28 days ago β’ 184
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper β’ 2502.01061 β’ Published Feb 3 β’ 188
Training Language Models to Self-Correct via Reinforcement Learning Paper β’ 2409.12917 β’ Published Sep 19, 2024 β’ 138
Addition is All You Need for Energy-efficient Language Models Paper β’ 2410.00907 β’ Published Oct 1, 2024 β’ 146
CLEAR: Character Unlearning in Textual and Visual Modalities Paper β’ 2410.18057 β’ Published Oct 23, 2024 β’ 203
ROICtrl: Boosting Instance Control for Visual Generation Paper β’ 2411.17949 β’ Published Nov 27, 2024 β’ 83
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper β’ 2411.04905 β’ Published Nov 7, 2024 β’ 116
view post Post 2086 LeRobot goes to driving school! πππ Hugging Face just announced a new collab with Yaak to bring the largest open-source self-driving dataset to LeRobot! Major kudos to HF's @cadene , as well as @sandhawalia , @Shnissen and the Yaak team!Check out the blog post here: https://huggingface.co/blog/lerobot-goes-to-driving-school See translation 1 reply Β· π 10 10 π₯ 7 7 β€οΈ 5 5 + Reply