FELLE: Autoregressive Speech Synthesis with Token-Wise Coarse-to-Fine Flow Matching
Paper
ā¢
2502.11128
ā¢
Published
Machine Learning for Audio/Speech
return_timestamps=True
helps reduce hallucinations, particularly when doing long-form evaluation with Transformersā āchunkedā algorithm. The cat sat on the on the on the mat.
<|0.00|> The cat sat on the on the on the mat.<|5.02|>
<|0.00|> The cat sat on the mat.<|5.02|>