Mel spectogram conversion

by sahilshah - opened Nov 13, 2024

Nov 13, 2024

It seems to call the model I need to convert the audio into a mel spectogram. In python I have done that with whisper's API however here I am wondering how to go about it? Wondering if there are Apple APIs or reference implementations that are well tested or do I need to write one from scratch?

lithium0003

Owner Nov 13, 2024

If you use model from swift, vDSP.FFT on Accelerate can convert fft and convert raw fft to mel by table.
https://developer.apple.com/documentation/accelerate/vdsp/fft

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment