Pillow opencv-python num2words ffmpeg-python transformers @ git+https://github.com/huggingface/transformers.git@refs/pull/36126/head accelerate>=0.26.0 decord==0.6.0