|
--- |
|
license: openrail |
|
datasets: |
|
- Fhrozen/AudioSet2K22 |
|
- Chr0my/Epidemic_sounds |
|
- ChristophSchuhmann/lyrics-index |
|
- Cropinky/rap_lyrics_english |
|
- tsterbak/eurovision-lyrics-1956-2023 |
|
- brunokreiner/genius-lyrics |
|
- google/MusicCaps |
|
- ccmusic-database/music_genre |
|
- Hyeon2/riffusion-musiccaps-dataset |
|
- SamAct/autotrain-data-musicprompt |
|
- Chr0my/Epidemic_music |
|
- juliensimon/autonlp-data-song-lyrics |
|
- Datatang/North_American_English_Speech_Data_by_Mobile_Phone_and_PC |
|
- Chr0my/freesound.org |
|
- teticio/audio-diffusion-256 |
|
- KELONMYOSA/dusha_emotion_audio |
|
- Ar4ikov/iemocap_audio_text_splitted |
|
- flexthink/ljspeech |
|
- mozilla-foundation/common_voice_13_0 |
|
- facebook/voxpopuli |
|
- SocialGrep/one-million-reddit-jokes |
|
- breadlicker45/human-midi-rlhf |
|
- breadlicker45/midi-gpt-music-small |
|
- projectlosangeles/Los-Angeles-MIDI-Dataset |
|
- huggingartists/epic-rap-battles-of-history |
|
- SocialGrep/one-million-reddit-confessions |
|
- shahules786/prosocial-nsfw-reddit |
|
- Thewillonline/reddit-sarcasm |
|
- autoevaluate/autoeval-eval-futin__guess-vi-4200fb-2012366606 |
|
- lmsys/chatbot_arena_conversations |
|
- mozilla-foundation/common_voice_11_0 |
|
- mozilla-foundation/common_voice_4_0 |
|
- dell-research-harvard/AmericanStories |
|
- zZWipeoutZz/insane_style |
|
- mu-llama/MusicQA |
|
- RaphaelOlivier/whisper_adversarial_examples |
|
- huggingartists/metallica |
|
- vldsavelyev/guitar_tab |
|
- NLPCoreTeam/humaneval_ru |
|
- seungheondoh/audioset-music |
|
- gary109/onset-singing3_corpora_parliament_processed_MIR-ST500 |
|
- LDD5522/Rock_Vocals |
|
- huggingartists/rage-against-the-machine |
|
- huggingartists/chester-bennington |
|
- huggingartists/logic |
|
- cmsolson75/artist_song_lyric_dataset |
|
- BhavyaMuni/artist-lyrics |
|
- vjain/emotional_intelligence |
|
- mhenrichsen/context-aware-splits |
|
metrics: |
|
- accuracy |
|
- bertscore |
|
- bleu |
|
- bleurt |
|
- brier_score |
|
- character |
|
- chrf |
|
language: |
|
- en |
|
- es |
|
- it |
|
- pt |
|
- la |
|
- fr |
|
- ru |
|
- zh |
|
- ja |
|
- el |
|
library_name: transformers |
|
tags: |
|
- music |
|
pipeline_tag: text-to-speech |
|
--- |
|
# SoundSlayerAI |
|
|
|
SoundSlayerAI is an innovative project that focuses on music-related tasks This project aims to provide various functionalities for audio analysis and processing, making it easier to work with music datasets. |
|
|
|
## Datasets |
|
|
|
SoundSlayerAI makes use of the following datasets: |
|
|
|
- Fhrozen/AudioSet2K22 |
|
- Chr0my/Epidemic_sounds |
|
- ChristophSchuhmann/lyrics-index |
|
- Cropinky/rap_lyrics_english |
|
- tsterbak/eurovision-lyrics-1956-2023 |
|
- brunokreiner/genius-lyrics |
|
- google/MusicCaps |
|
- ccmusic-database/music_genre |
|
- Hyeon2/riffusion-musiccaps-dataset |
|
- SamAct/autotrain-data-musicprompt |
|
- Chr0my/Epidemic_music |
|
- juliensimon/autonlp-data-song-lyrics |
|
- Datatang/North_American_English_Speech_Data_by_Mobile_Phone_and_PC |
|
- Chr0my/freesound.org |
|
- teticio/audio-diffusion-256 |
|
- KELONMYOSA/dusha_emotion_audio |
|
- Ar4ikov/iemocap_audio_text_splitted |
|
- flexthink/ljspeech |
|
- mozilla-foundation/common_voice_13_0 |
|
- facebook/voxpopuli |
|
- SocialGrep/one-million-reddit-jokes |
|
- breadlicker45/human-midi-rlhf |
|
- breadlicker45/midi-gpt-music-small |
|
- projectlosangeles/Los-Angeles-MIDI-Dataset |
|
- huggingartists/epic-rap-battles-of-history |
|
- SocialGrep/one-million-reddit-confessions |
|
- shahules786/prosocial-nsfw-reddit |
|
- Thewillonline/reddit-sarcasm |
|
- autoevaluate/autoeval-eval-futin__guess-vi-4200fb-2012366606 |
|
- lmsys/chatbot_arena_conversations |
|
- mozilla-foundation/common_voice_11_0 |
|
- mozilla-foundation/common_voice_4_0 |
|
|
|
## Library |
|
|
|
The core library used in this project is "pyannote-audio." This library provides a wide range of functionalities for audio analysis and processing, making it an excellent choice for working with music datasets. The "pyannote-audio" library offers a comprehensive set of tools and algorithms for tasks such as audio segmentation, speaker diarization, music transcription, and more. |
|
|
|
## Metrics |
|
|
|
To evaluate the performance of SoundSlayerAI, several metrics are employed, including: |
|
|
|
- Accuracy |
|
- Bertscore |
|
- BLEU |
|
- BLEURT |
|
- Brier Score |
|
- Character |
|
|
|
These metrics help assess the effectiveness and accuracy of the implemented algorithms and models. |
|
|
|
## Language |
|
|
|
The SoundSlayerAI project primarily focuses on the English language. The datasets and models used in this project are optimized for English audio and text analysis tasks. |
|
|
|
## Usage |
|
|
|
To use SoundSlayerAI, follow these steps: |
|
|
|
1. Install the required dependencies by running `pip install pyannote-audio`. |
|
|
|
2. Import the necessary modules from the "pyannote.audio" package to access the desired functionalities. |
|
|
|
3. Load the audio data or use the provided datasets to perform tasks such as audio segmentation, speaker diarization, music transcription, and more. |
|
|
|
4. Apply the appropriate algorithms and models from the "pyannote.audio" library to process and analyze the audio data. |
|
|
|
5. Evaluate the results using the specified metrics, such as accuracy, bertscore, BLEU, BLEURT, brier_score, and character. |
|
|
|
6. Iterate and refine your approach to achieve the desired outcomes for your music-related tasks. |
|
|
|
## License |
|
|
|
SoundSlayerAI is released under the Openrail license. Please refer to the LICENSE file for more details. |
|
|
|
## Contributions |
|
|
|
Contributions to SoundSlayerAI are welcome! If you have any ideas, bug fixes, or enhancements, feel free to submit a pull request or open an issue on the GitHub repository. |
|
|
|
## Contact |
|
|
|
For any inquiries or questions regarding SoundSlayerAI, please reach out to the project maintainer at [insert email address]. |
|
|
|
Thank you for your interest in SoundSlayerAI! |