Audio dataset - a datasets-examples Collection

datasets-examples 's Collections

File names and splits

Manual Configuration

Format: CSV and TSV

Audio dataset

updated Aug 2

N datasets showcase how to configure and load audio datasets

datasets-examples/doc-audio-1

Viewer • Updated Aug 2 • 4 • 68
Note If your dataset only consists of one column with audio, you can use a simple setup: store your audio files at the root.
datasets-examples/doc-audio-2

Viewer • Updated Aug 2 • 4 • 37
Note or in a subdirectory
datasets-examples/doc-audio-3

Viewer • Updated Aug 2 • 4 • 43
Note Multiple formats are supported at the same time, including AIFF, FLAC, MP3, OGG and WAV.
datasets-examples/doc-audio-4

Viewer • Updated Aug 2 • 4 • 39
Note If you have several splits, you can put your audio files into directories named accordingly. This setup currently creates an extra unwanted label column (follow the bug resolution at https://github.com/huggingface/dataset-viewer/issues/3014).
datasets-examples/doc-audio-5

Viewer • Updated Aug 2 • 4 • 51
Note If there is additional information you'd like to include about your dataset, like the transcription, add it as a `metadata.csv` file in your repository.
datasets-examples/doc-audio-6

Viewer • Updated Aug 2 • 4 • 57
Note You can also use a JSONL file `metadata.jsonl`.
datasets-examples/doc-audio-7

Viewer • Updated Aug 2 • 4 • 43
Note You can also locate the audio files in a different subdirectory of the split.
datasets-examples/doc-audio-8

Viewer • Updated Aug 2 • 4 • 43
Note For audio classification datasets, you can also use a simple setup: use directories to name the audio classes.
datasets-examples/doc-audio-9

Viewer • Updated Aug 2 • 4 • 40
Note You can also use provide multiple splits.
datasets-examples/doc-audio-10

Viewer • Updated Aug 2 • 4 • 35
Note You can disable this automatic behavior in the YAML configuration. If your directory names have no special meaning, set `drop_labels: true` in the README header
datasets-examples/doc-audio-11

Viewer • Updated Aug 2 • 4 • 44
Note The WebDataset format is well suited for large scale audio datasets. It consists of TAR archives containing audio files and their metadata and is optimized for streaming.