Tongyi-ConvAI
/

MMEvol-Qwen2-7B

Visual Question Answering

Model card Files Files and versions Community

MMEvol-Qwen2-7B / README.md

haonanzhang's picture

Update README.md

7fac8cd verified 28 days ago

|

history blame contribute delete

1.74 kB

metadata

license: apache-2.0
language:
  - en
metrics:
  - accuracy
base_model:
  - Qwen/Qwen2-VL-7B-Instruct
pipeline_tag: visual-question-answering

MMEvol Model Card

Model Details

Here are the pretrained weights and instruction tuning weights

Model	Pretrained Projector	Base LLM	PT Data	IT Data	Download
MMEvol-Qwen2-7B	mm_projector	Qwen2-7B	LLaVA-Pretrain	MMEvol	ckpt

Performance

VLMEvalKit Support (OpenCompass)

Model	MME_C	MMStar	HallBench	MathVista_mini	MMMU_val	AI2D	POPE	BLINK	RWQA
MMEvol-Qwen2-7B	55.8	51.6	64.1	52.4	45.1	74.7	87.8	47.7	63.9

VLMEvalKit Not Support (VQADataSet)

Model	VQA_v2	GQA	MIA	MMSInst
MMEvol-Qwen2-7B	83.1	65.5	77.6	41.8

Paper or resources for more information

Page: https://mmevol.github.io/
arXiv: https://arxiv.org/pdf/2409.05840

License

Llama 3 is licensed under the LLAMA 3 Community License, Copyright (c) Meta Platforms, Inc. All Rights Reserved.

Contact us if you have any questions

Run Luo — [email protected]
Haonan Zhang — [email protected]