File size: 1,738 Bytes
ce04e1d 067b487 ce04e1d |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 |
---
license: apache-2.0
language:
- en
metrics:
- accuracy
base_model:
- Qwen/Qwen2-VL-7B-Instruct
pipeline_tag: visual-question-answering
---
# MMEvol Model Card
## Model Details
Here are the pretrained weights and instruction tuning weights
| Model | Pretrained Projector | Base LLM | PT Data | IT Data | Download |
| ---------------- | -------------------- | --------- | ------------------------------------------------------------ | ------- | -------- |
| MMEvol-Qwen2-7B | [mm_projector](https://huggingface.co/Tongyi-ConvAI/MMEvol-Qwen2-7B/tree/main) | Qwen2-7B | [LLaVA-Pretrain](https://huggingface.co/datasets/liuhaotian/LLaVA-Pretrain) | MMEvol | [ckpt](https://huggingface.co/Tongyi-ConvAI/MMEvol-Qwen2-7B/tree/main) |
## Performance
### VLMEvalKit Support (OpenCompass)
| Model | MME_C | MMStar | HallBench | MathVista_mini | MMMU_val | AI2D | POPE | BLINK | RWQA |
| ---------------- | ----- | ------ | --------- | -------------- | -------- | ---- | ---- | ----- | ---- |
| MMEvol-Qwen2-7B | 55.8 | 51.6 | 64.1 | 52.4 | 45.1 | 74.7 | 87.8 | 47.7 | 63.9 |
### VLMEvalKit Not Support (VQADataSet)
| Model | VQA_v2 | GQA | MIA | MMSInst |
| ---------------- | ------ | ---- | ---- | ------- |
| MMEvol-Qwen2-7B | 83.1 | 65.5 | 77.6 | 41.8 |
## Paper or resources for more information
- Page: https://mmevol.github.io/
- arXiv: https://arxiv.org/pdf/2409.05840
## License
Llama 3 is licensed under the LLAMA 3 Community License,
Copyright (c) Meta Platforms, Inc. All Rights Reserved.
## Contact us if you have any questions
- Run Luo — [email protected]
- Haonan Zhang — [email protected]
|