AlexHung29629
/

test_mllama_v12

Feature Extraction

Model card Files Files and versions Community

AlexHung29629 commited on Nov 19

Commit

d3c0181

•

1 Parent(s): 161d998

Update ultravox_processing.py

Files changed (1) hide show

ultravox_processing.py +1 -1

ultravox_processing.py CHANGED Viewed

@@ -140,7 +140,7 @@ class UltravoxProcessor(transformers.ProcessorMixin):
                 assert sampling_rate is not None, "Sampling rate must be provided."
                 audio_len = 30 * sampling_rate
             else:
-                audio_len = audio.shape[-1]
             # It's guaranteed that the number of frames is less than or equal to this amount.
             # For Whisper this is exact AFAICT, but for Wav2Vec2 it's an upper bound.
             # Currently, StackAudioFrames makes sure an over-estimation won't cause issues by padding the audio embeddings.

                 assert sampling_rate is not None, "Sampling rate must be provided."
                 audio_len = 30 * sampling_rate
             else:
+                audio_len = max([a.shape[-1] for a in audio])
             # It's guaranteed that the number of frames is less than or equal to this amount.
             # For Whisper this is exact AFAICT, but for Wav2Vec2 it's an upper bound.
             # Currently, StackAudioFrames makes sure an over-estimation won't cause issues by padding the audio embeddings.