PERSE: Personalized 3D Generative Avatars from A Single Portrait Paper • 2412.21206 • Published Dec 30, 2024 • 19
view article Article Transformers.js v3: WebGPU support, new models & tasks, and more… Oct 22, 2024 • 71
view article Article Making Browser-Based Inference Actually Usable By wizenheimer • 13 days ago • 10
Phi-4 Collection Phi-4 family of small language and multi-modal models. • 7 items • Updated 10 days ago • 109
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching Paper • 2410.06885 • Published Oct 9, 2024 • 45
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Paper • 2502.06781 • Published Feb 10 • 60
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 9 items • Updated 21 days ago • 49
GeoPixel Collection Pixel Grounding Large Multimodal Model in Remote Sensing • 5 items • Updated 16 days ago • 1
ArTST - Arabic Text Speech Transformer Collection Open source project for Arabic Speech Recognition and Generation • 13 items • Updated 13 days ago • 8
Step-Audio Collection Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS • 3 items • Updated 25 days ago • 30
The Ultimate Collection of Code Classifiers Collection 🔥 15 classifiers, 124M parameters, one per programming language— for assessing the educational value of GitHub code • 15 items • Updated 22 days ago • 11
SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering? Paper • 2502.12115 • Published 24 days ago • 43