Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
sugatoray
's Collections
LLMs
LLM Tools
AV LLMs
LLM Training Datasets
Papers
Leaderboards 🔥
Papers-MoE
Papers-LLMEval
LLM LLAMA3
Papers-Fundamentals
TFM: TimeSeries Foundation Models
Papers-Benchmarks
LLMs-EmbeddingModels
LLMs + Mamba
LLM + Datasets : Finance
AV LLMs
updated
1 day ago
A collection of Audio, Video and Visual LLMs.
Upvote
2
myshell-ai/OpenVoice
Text-to-Speech
•
Updated
Apr 24
•
390
Running
962
🤗
OpenVoice
dataautogpt3/ProteusV0.3
Text-to-Image
•
Updated
Feb 12
•
44.7k
•
92
ByteDance/SDXL-Lightning
Text-to-Image
•
Updated
Apr 3
•
78k
•
1.9k
openai/whisper-large-v3
Automatic Speech Recognition
•
Updated
Aug 12
•
3.89M
•
•
3.55k
stabilityai/TripoSR
Image-to-3D
•
Updated
Aug 9
•
25.7k
•
450
Efficient-Large-Model/VILA-7b
Text Generation
•
Updated
Mar 4
•
400
•
25
google/paligemma-3b-pt-896
Image-Text-to-Text
•
Updated
Jul 19
•
120k
•
107
microsoft/Phi-3-vision-128k-instruct
Text Generation
•
Updated
Aug 20
•
214k
•
899
stabilityai/stable-audio-open-1.0
Text-to-Audio
•
Updated
Jul 31
•
20.4k
•
901
OpenVLA: An Open-Source Vision-Language-Action Model
Paper
•
2406.09246
•
Published
Jun 13
•
36
aiola/whisper-medusa-v1
Updated
Aug 3
•
164
•
174
merve/idefics3llama-vqav2
Updated
about 1 month ago
•
8
black-forest-labs/FLUX.1-schnell
Text-to-Image
•
Updated
Aug 16
•
1.05M
•
•
2.55k
Running
on
Zero
99
😻
Llama3.1 S V0.2 Checkpoint 2024 08 20
gpt-omni/mini-omni
Text-to-Speech
•
Updated
Sep 4
•
4
•
382
fishaudio/fish-speech-1.4
Text-to-Speech
•
Updated
18 days ago
•
7.7k
•
379
Running
on
Zero
145
📲🫴🏻👁
Tonic's GOT OCR
GOT - OCR (from : UCAS, Beijing)
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
•
Updated
24 days ago
•
202k
•
1.01k
apple/coreml-sam2-large
Mask Generation
•
Updated
28 days ago
•
197
•
18
coreml-projects/sam-2-studio
Updated
10 days ago
•
14
mistralai/Pixtral-12B-2409
Updated
10 days ago
•
9
•
377
allenai/Molmo-72B-0924
Image-Text-to-Text
•
Updated
1 day ago
•
3.92k
•
224
openai/whisper-large-v3-turbo
Automatic Speech Recognition
•
Updated
7 days ago
•
122k
•
•
889
Revai/reverb-asr
Automatic Speech Recognition
•
Updated
3 days ago
•
79
•
60
Running
on
Zero
236
💬
GOT Online
facebook/vfusion3d
Image-to-3D
•
Updated
Aug 13
•
410
•
41
facebook/cotracker
Updated
16 days ago
•
8.14k
•
26
rhymes-ai/Aria
Text Generation
•
Updated
2 days ago
•
2.07k
•
272
Upvote
2
Share collection
View history
Collection guide
Browse collections