Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
CD
community
Activity Feed
Follow
4
AI & ML interests
None defined yet.
Recent Activity
TianyuZhang
authored
a paper
1 day ago
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding
sheryc
authored
a paper
1 day ago
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding
sheryc
authored
a paper
about 2 months ago
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
View all activity
Team members
3
models
None public yet
datasets
199
Sort: Recently updated
CLAPv2/audioset_t5_debiased
Updated
Nov 11, 2024
•
10
CLAPv2/audiocaps
Updated
Nov 11, 2024
•
8
CLAPv2/audioset_strong
Viewer
•
Updated
Nov 11, 2024
•
512
•
49
CLAPv2/MUSDB18-HQ
Viewer
•
Updated
Nov 11, 2024
•
564
•
14
CLAPv2/fma_full_16bit
Updated
Nov 3, 2024
•
388
CLAPv2/sonniss_game_effects
Viewer
•
Updated
Oct 31, 2024
•
4.98k
•
10
CLAPv2/esc50
Viewer
•
Updated
Oct 30, 2024
•
2k
•
10
CLAPv2/audiostock
Viewer
•
Updated
Oct 30, 2024
•
9.9k
•
85
CLAPv2/Europarl-st
Viewer
•
Updated
Oct 26, 2024
•
153k
•
13
CLAPv2/common_voice
Viewer
•
Updated
Oct 26, 2024
•
1.18M
•
26
Expand 199 datasets