view article Article MONET: Lowering the bar for World-Class Image Generation research. jasperai • about 9 hours ago • 4
MONET - Massive Open Non-redundant, Enriched, Text-to-image Collection A curated, deduped & recaptioned open image–text dataset of 104.9M samples released under the Apache2.0 licence. https://huggingface.co/blog/jasperai/ • 4 items • Updated about 3 hours ago • 4
MiniCPM5 Collection A SOTA 1B on-device LLM, small yet powerful. • 11 items • Updated 3 days ago • 19
view article Article Why Open Models Are the Only Sustainable Way to Teach AI penelopegittos • 6 days ago • 8
Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation Paper • 2605.19833 • Published 10 days ago • 131
Scaling Properties of Continuous Diffusion Spoken Language Models Paper • 2604.24416 • Published Apr 27 • 1
SEC EDGAR Collection A collection of all major filings available through the SEC EDGAR database. • 11 items • Updated Apr 7 • 7