MCG-NJU
/

p-MoD-LLaVA-NeXT-7B

Image-Text-to-Text

pmod_llava_llama

Model card Files Files and versions Community

JungleGym commited on 21 days ago

Commit

4c09c38

•

1 Parent(s): a5bfe56

Update README.md

Files changed (1) hide show

README.md +20 -3

README.md CHANGED Viewed

@@ -1,3 +1,20 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+base_model:
+- lmsys/vicuna-7b-v1.5
+- openai/clip-vit-large-patch14-336
+---
+# p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
+This is the official model checkpoint of [p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay](https://arxiv.org/abs/2412.04449).
+Please refer to [this repository](https://github.com/MCG-NJU/p-MoD) for our code.
+## Model Description
+This model is pretrained on [LCS-558K](https://huggingface.co/datasets/liuhaotian/LLaVA-Pretrain) image caption data, and instruction-tuned on [779K LLaVA-NeXT instruction data](https://huggingface.co/datasets/lmms-lab/LLaVA-NeXT-Data).
+## Citation
+TBD
+## License
+Llama 2 is licensed under the LLAMA 2 Community License,
+Copyright (c) Meta Platforms, Inc. All Rights Reserved.