Video-Text-to-Text
Transformers
Safetensors
English
llava
text-generation
multimodal
Eval Results
Inference Endpoints
ZhangYuanhan's picture
Upload trainer_state.json with huggingface_hub
bcdabde verified