WLV3t-SG-LN-TSHLBT / README.md
lynusl's picture
Create README.md
a34642d verified
metadata
datasets:
  - aether-raid/SGdataset
metrics:
  - wer
base_model:
  - openai/whisper-large-v3-turbo
pipeline_tag: automatic-speech-recognition

Whisper Large V3 Turbo (WLV3t) trained on sgatc with

  • Loud Normalization (LN)
  • The following Augmentations (HLBT):
    • T: time stretch
    • S: seven band parametric EQ
    • H: high pass
    • L: low pass
    • B: band pass
    • T: tanh distortion