OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts Paper • 2503.22952 • Published 23 days ago • 18
openai/whisper-large-v3-turbo Automatic Speech Recognition • Updated Oct 4, 2024 • 3.92M • • 2.3k
OpenAssistant/reward-model-deberta-v3-large-v2 Text Classification • Updated Feb 1, 2023 • 13.2k • 218