arxiv:2407.14329
Xuenan Xu
wsntxxn
AI & ML interests
Text to Speech Synthesis
Text to Music Synthesis
Singing Voice Synthesis
Recent Activity
new activity
13 days ago
wsntxxn/cnn8rnn-audioset-sed:Adding `safetensors` variant of this model
new activity
16 days ago
wsntxxn/cnn14rnn-tempgru-audiocaps-captioning:Adding `safetensors` variant of this model
new activity
23 days ago
wsntxxn/effb2-trm-audiocaps-captioning:Adding `safetensors` variant of this model
Organizations
None yet
Papers
10
models
7
wsntxxn/cnn8rnn-audioset-sed
Audio Classification
•
Updated
•
214
•
2
wsntxxn/cnn14rnn-tempgru-audiocaps-captioning
Feature Extraction
•
Updated
•
169
•
1
wsntxxn/effb2-trm-audiocaps-captioning
Feature Extraction
•
Updated
•
158
•
1
wsntxxn/effb2-trm-clotho-captioning
Feature Extraction
•
Updated
•
155
•
1
wsntxxn/cnn8rnn-w2vmean-audiocaps-grounding
Audio Classification
•
Updated
•
111
•
2
wsntxxn/audiocaps-simple-tokenizer
Updated
wsntxxn/clotho-simple-tokenizer
Updated
datasets
None public yet