Video-Text-to-Text
Transformers
Safetensors
English
llava
text-generation
multimodal
Eval Results
Inference Endpoints
ZhangYuanhan commited on
Commit
0cf6736
1 Parent(s): 733e147

Update config.json

Browse files
Files changed (1) hide show
  1. config.json +1 -0
config.json CHANGED
@@ -163,6 +163,7 @@
163
  "initializer_range": 0.02,
164
  "intermediate_size": 18944,
165
  "max_position_embeddings": 32768,
 
166
  "max_window_layers": 28,
167
  "mm_hidden_size": 1152,
168
  "mm_newline_position": "grid",
 
163
  "initializer_range": 0.02,
164
  "intermediate_size": 18944,
165
  "max_position_embeddings": 32768,
166
+ "image_token_index": 151646,
167
  "max_window_layers": 28,
168
  "mm_hidden_size": 1152,
169
  "mm_newline_position": "grid",