Pillow opencv-python num2words ffmpeg-python torch transformers @ git+https://github.com/huggingface/transformers.git@refs/pull/36126/head flash-attn accelerate>=0.26.0 decord==0.6.0