Video-Text-to-Text
Transformers
Safetensors
English
llava
text-generation
multimodal
Eval Results
Inference Endpoints
ZhangYuanhan commited on
Commit
35a0ba2
1 Parent(s): 01a5f7c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -207,7 +207,7 @@ input_ids = tokenizer_image_token(prompt_question, tokenizer, IMAGE_TOKEN_INDEX,
207
  cont = model.generate(
208
  input_ids,
209
  images=video,
210
- modalities= ["video"]
211
  do_sample=False,
212
  temperature=0,
213
  max_new_tokens=4096,
 
207
  cont = model.generate(
208
  input_ids,
209
  images=video,
210
+ modalities= ["video"],
211
  do_sample=False,
212
  temperature=0,
213
  max_new_tokens=4096,