SIMS Collection Models and evaluation data from the paper: "Scaling Analysis of Interleaved Speech-Text Language Models" • 4 items • Updated 2 days ago • 1
Scaling Analysis of Interleaved Speech-Text Language Models Paper • 2504.02398 • Published 3 days ago • 24
RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling Paper • 2503.09601 • Published 25 days ago • 14
Slamming: Training a Speech Language Model on One GPU in a Day Paper • 2502.15814 • Published Feb 19 • 69
Slam Collection All resources for SpeechLMs from "Slamming: Training a Speech Language Model on One GPU in a Day". We provide tokeniser, lm, and datasets • 6 items • Updated Feb 25 • 13
Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights Paper • 2502.09619 • Published Feb 13 • 34
Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation Paper • 2501.03059 • Published Jan 6 • 22
Speaking Style Conversion in the Waveform Domain Using Discrete Self-Supervised Units Paper • 2212.09730 • Published Dec 19, 2022 • 1