multimodal meta-llama/Llama-3.2-11B-Vision-Instruct Image-Text-to-Text • Updated Dec 4, 2024 • 592k • • 1.46k
meta-llama/Llama-3.2-11B-Vision-Instruct Image-Text-to-Text • Updated Dec 4, 2024 • 592k • • 1.46k
audio-collection Running on T4 28 28 Parakeet-tdt_ctc-1.1b 🦜 Generate text transcripts with timestamps from audio or video
Running on T4 28 28 Parakeet-tdt_ctc-1.1b 🦜 Generate text transcripts with timestamps from audio or video