Andybeyond's picture
Create datasets/melodies/README.md
5b311bf verified

Melodies Dataset

This directory contains the training, validation, and test datasets for melody generation.

Directory Structure

  • train/ (80% of data): Training dataset
  • val/ (10% of data): Validation dataset
  • test/ (10% of data): Test dataset

File Formats

  • MIDI files (.mid)
  • Each file follows naming convention: [genre]_[mood]_[key]_[tempo].mid Example: pop_happy_cmajor_120.mid

Genre Categories

  • pop/
  • jazz/
  • classical/

Metadata Format

Each MIDI file should have corresponding metadata:

  • Genre
  • Key signature
  • Tempo (BPM)
  • Mood/emotion
  • Time signature

Data Sources

[To be populated with actual sources]

Usage

This dataset is used for:

  1. Training melody generation models
  2. Model evaluation
  3. Style transfer experiments

File Count

  • Train: [number] files
  • Val: [number] files
  • Test: [number] files

Notes

  • All MIDI files are quantized to 16th notes
  • Tempo range: 60-180 BPM
  • All melodies are monophonic

Directory Structure

datasets/melodies/ β”œβ”€β”€ README.md β”œβ”€β”€ train/ β”‚ β”œβ”€β”€ pop/ β”‚ β”œβ”€β”€ jazz/ β”‚ └── classical/ β”œβ”€β”€ val/ β”‚ β”œβ”€β”€ pop/ β”‚ β”œβ”€β”€ jazz/ β”‚ └── classical/ └── test/ β”œβ”€β”€ pop/ β”œβ”€β”€ jazz/ └── classical/