Video-Text-to-Text
Transformers
Safetensors
English
llava
text-generation
multimodal
Eval Results
Inference Endpoints
ZhangYuanhan commited on
Commit
733e147
1 Parent(s): 35a0ba2

Update config.json

Browse files
Files changed (1) hide show
  1. config.json +1 -1
config.json CHANGED
@@ -178,7 +178,7 @@
178
  "mm_vision_select_layer": -2,
179
  "mm_vision_tower": "google/siglip-so400m-patch14-384",
180
  "mm_vision_tower_lr": 2e-06,
181
- "model_type": "qwen2",
182
  "num_attention_heads": 28,
183
  "num_hidden_layers": 28,
184
  "num_key_value_heads": 4,
 
178
  "mm_vision_select_layer": -2,
179
  "mm_vision_tower": "google/siglip-so400m-patch14-384",
180
  "mm_vision_tower_lr": 2e-06,
181
+ "model_type": "llava",
182
  "num_attention_heads": 28,
183
  "num_hidden_layers": 28,
184
  "num_key_value_heads": 4,