utmos / README.md
clementruhm's picture
Update README.md
b44d848 verified
metadata
license: mit

This is an UTMOS model traced into a jit for simple use. The model and the inference code are from this space.

Usage:

from huggingface_hub import hf_hub_download
import torch
import soundfile as sf

# Download the model from repo
model_path = hf_hub_download(
    repo_id="balacoon/utmos",
    filename="utmos.jit",
    repo_type="model",
    local_dir="./",
)

# load model
utmos_model = torch.jit.load(model_path).to(torch.device("cuda"))
# load audio
wav, sr = sf.read(
    "rms_arctic_a0001.wav",
    dtype="int16"
)
assert sr == 16000
# run inference
x = torch.tensor(wav).unsqueeze(0).cuda()
mos = utmos_model(x).item()
print(mos)