Ovis2-16B
Generate speech from text with customizable voices
Generate text based on images and prompts
Demo of GOT-OCR 2.0's Transformers implementation
Gradio demo for https://github.com/jixiaozhong/Sonic
Co-Speech Gesture Video Generation