Question Answering
Transformers
English
Chinese
multimodal
vqa
text
audio
Eval Results
Inference Endpoints
File size: 64 Bytes
b744e9c
 
 
 
1
2
3
4
torch>=1.9.0
transformers>=4.10.0
numpy>=1.21.0
gradio>=3.0.0