Efficient Audio Captioning with Encoder-Level Knowledge Distillation
Paper
•
2407.14329
•
Published
•
4
None defined yet.
return_timestamps=True
helps reduce hallucinations, particularly when doing long-form evaluation with Transformers’ “chunked” algorithm. The cat sat on the on the on the mat.
<|0.00|> The cat sat on the on the on the mat.<|5.02|>
<|0.00|> The cat sat on the mat.<|5.02|>