9 1

Xuenan Xu

wsntxxn

https://wsntxxn.github.io

AI & ML interests

Text to Speech Synthesis Text to Music Synthesis Singing Voice Synthesis

Recent Activity

new activity 2 days ago

wsntxxn/cnn8rnn-w2vmean-audiocaps-grounding:Training Repository & AudioCaps2.0

updated a Space 19 days ago

wsntxxn/MM-StoryAgent

new activity 19 days ago

wsntxxn/MM-StoryAgent:你好测试两次每次都看到图像出来了然后马上显示图像错误再也看不到也下载不了图像

View all activity

Organizations

None yet

wsntxxn's activity

New activity in wsntxxn/cnn8rnn-w2vmean-audiocaps-grounding 2 days ago

Training Repository & AudioCaps2.0

#1 opened 3 days ago by

jocoyo

updated a Space 19 days ago

MM StoryAgent

📘

Generate a storytelling video from a topic and scene

New activity in wsntxxn/MM-StoryAgent 19 days ago

你好测试两次每次都看到图像出来了然后马上显示图像错误再也看不到也下载不了图像

#1 opened 25 days ago by

mingweihehe

Generation succeeds, but then gives error before I preview the images

#2 opened 21 days ago by

Ihatenamesforever

New activity in wsntxxn/cnn8rnn-audioset-sed about 1 month ago

Request of the Training Repository

#2 opened about 1 month ago by

jocoyo

New activity in wsntxxn/cnn8rnn-audioset-sed 3 months ago

Adding `safetensors` variant of this model

#1 opened 5 months ago by

SFconvertbot

New activity in wsntxxn/cnn14rnn-tempgru-audiocaps-captioning 3 months ago

Adding `safetensors` variant of this model

#1 opened 3 months ago by

SFconvertbot

New activity in wsntxxn/effb2-trm-audiocaps-captioning 4 months ago

Adding `safetensors` variant of this model

#1 opened 4 months ago by

SFconvertbot

New activity in wsntxxn/effb2-trm-clotho-captioning 4 months ago

Adding `safetensors` variant of this model

#1 opened 4 months ago by

SFconvertbot

authored 5 papers 8 months ago

updated 5 models 8 months ago

wsntxxn/cnn14rnn-tempgru-audiocaps-captioning

Feature Extraction • Updated Dec 27, 2024 • 29 • 1

wsntxxn/effb2-trm-audiocaps-captioning

Feature Extraction • Updated Dec 20, 2024 • 67 • 1

wsntxxn/effb2-trm-clotho-captioning

Feature Extraction • Updated Dec 17, 2024 • 82 • 1

wsntxxn/cnn8rnn-w2vmean-audiocaps-grounding

Audio Classification • Updated Aug 19, 2024 • 310 • 2

wsntxxn/cnn8rnn-audioset-sed

Audio Classification • Updated Dec 30, 2024 • 345 • 3

authored a paper 9 months ago

Efficient Audio Captioning with Encoder-Level Knowledge Distillation

Paper • 2407.14329 • Published Jul 19, 2024 • 5

Xuenan Xu

AI & ML interests

Recent Activity

Organizations

wsntxxn's activity

Training Repository & AudioCaps2.0

MM StoryAgent

你好 测试两次 每次都看到图像出来了 然后马上显示图像错误 再也看不到 也下载不了图像

Generation succeeds, but then gives error before I preview the images

Request of the Training Repository

Adding `safetensors` variant of this model

Adding `safetensors` variant of this model

Adding `safetensors` variant of this model

Adding `safetensors` variant of this model

你好测试两次每次都看到图像出来了然后马上显示图像错误再也看不到也下载不了图像