xyzDivergence
/

whisper-large-v2-vietnamese-lyrics-transcription

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

Update README.md

#2

by GooglyEyeSuperman - opened 25 days ago

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -44,7 +44,7 @@ To generate the transcription for a song, we can use the Transformers [`pipeline
 ```
 ## Training Data
-The training dataset consists of 7,000 Vietnamese songs, in total of roughly 550 hours of audio, across various Vietnamese music genres, dialects and accents. Due to IP concerns, the data is not publicly available. Each song includes lyrics along with corresponding line-level timestamps, enabling precise mapping of audio segments to their respective lyrics based on the provided timestamp information.
 Technical report coming soon.
 This project was made through equal contributions from:

 ```
 ## Training Data
+The training dataset consists of 7,000 Vietnamese songs, in total of roughly 550 hours of audio, across various Vietnamese music genres, dialects and accents. Due to copyright concerns, the raw data is not publicly available. However, the CSV files, which contain links to the songs and lyrics, can be used for downloading and are available in [our repository](https://github.com/kelvinbksoh/Vietnamese-Automated-Lyrics-Transcription/tree/main/data_processing/in). Each song includes lyrics along with corresponding line-level timestamps, enabling precise mapping of audio segments to their respective lyrics based on the provided timestamp information.
 Technical report coming soon.
 This project was made through equal contributions from: