AIMv2 Collection A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 69
Whisper Collection OpenAI Whisper speech recognition models in MLX format • 48 items • Updated Oct 1, 2024 • 23
Text-To-Speech datasets Collection Some of my favorite TTS datasets, in English or in many other languages ! • 14 items • Updated Oct 3, 2024 • 4