fx
Browse files
README.md
CHANGED
@@ -18,18 +18,18 @@ tags:
|
|
18 |
|
19 |
# Arousal - Dominance - Valence
|
20 |
|
21 |
-
Dimensional Speech Emotion Recognition model of simultaneous use of [
|
22 |
-
Achieves `0.6760566` valence CCC on [MSP Podcast Test 1](https://paperswithcode.com/sota/speech-emotion-recognition-on-msp-podcast). Used as teacher for [
|
23 |
|
24 |
|
25 |
|
26 |
-
**[PapersWithCode](https://paperswithcode.com/dataset/msp-podcast)**
|
27 |
|
28 |
```
|
29 |
-
Wav2Small: Distilling Wav2Vec2 to 72K parameters for
|
30 |
-
|
31 |
-
|
32 |
-
2024, arXiV Preprint
|
33 |
```
|
34 |
|
35 |
<table style="width:500px">
|
@@ -41,7 +41,7 @@ Florian Eyben, Felix Burkhardt, Björn Schuller.
|
|
41 |
|
42 |
|
43 |
|
44 |
-
#
|
45 |
```python
|
46 |
import librosa
|
47 |
import torch
|
|
|
18 |
|
19 |
# Arousal - Dominance - Valence
|
20 |
|
21 |
+
Dimensional Speech Emotion Recognition model of simultaneous use of [wavlm](https://huggingface.co/3loi/SER-Odyssey-Baseline-WavLM-Multi-Attributes) / [wav2vec2.0](https://github.com/audeering/w2v2-how-to).
|
22 |
+
Achieves `0.6760566` valence CCC on [MSP Podcast Test 1](https://paperswithcode.com/sota/speech-emotion-recognition-on-msp-podcast). Used as teacher for [wav2small ..]().
|
23 |
|
24 |
|
25 |
|
26 |
+
**[PapersWithCode](https://paperswithcode.com/dataset/msp-podcast) / [arXiv](https://arxiv.org/abs/2408.13920)**
|
27 |
|
28 |
```
|
29 |
+
Wav2Small: Distilling Wav2Vec2 to 72K parameters for low-resource
|
30 |
+
speech emotion recognition.
|
31 |
+
D. Kounadis-Bastian, O. Schrüfer, A. Derington, H. Wierstorf,
|
32 |
+
F. Eyben, F. Burkhardt, B.W. Schuller. 2024, arXiV Preprint
|
33 |
```
|
34 |
|
35 |
<table style="width:500px">
|
|
|
41 |
|
42 |
|
43 |
|
44 |
+
# HowTo
|
45 |
```python
|
46 |
import librosa
|
47 |
import torch
|