mtgv
/

VisionLLaMA-Base-MAE

Image Classification

Model card Files Files and versions

mtgv commited on Mar 12, 2024

Commit

6dc0a2f

·

verified ·

1 Parent(s): 211026e

Update README.md

Files changed (1) hide show

README.md +12 -2

README.md CHANGED Viewed

@@ -8,6 +8,7 @@ metrics:
 - mIoU
 pipeline_tag: image-classification
 ---
 # VisionLLaMA-Base-MAE
 With the Masked Autoencoders' paradigm, VisionLLaMA-Base-MAE model is trained on ImageNet-1k without labels. It manifests substantial improvements over classification tasks (SFT, linear probing) on ImageNet-1K and the segmentation task on ADE20K.
@@ -18,8 +19,17 @@ With the Masked Autoencoders' paradigm, VisionLLaMA-Base-MAE model is trained on
 | VisionLLaMA-Base-MAE (ep1600) |84.3 | 71.7| 50.2 |
-# How to Use
-Please refer the [Github](https://github.com/Meituan-AutoML/VisionLLaMA) page for usage.

 - mIoU
 pipeline_tag: image-classification
 ---
 # VisionLLaMA-Base-MAE
 With the Masked Autoencoders' paradigm, VisionLLaMA-Base-MAE model is trained on ImageNet-1k without labels. It manifests substantial improvements over classification tasks (SFT, linear probing) on ImageNet-1K and the segmentation task on ADE20K.
 | VisionLLaMA-Base-MAE (ep1600) |84.3 | 71.7| 50.2 |
+# How to Use
+Please refer the [Github](https://github.com/Meituan-AutoML/VisionLLaMA) page for usage.
+# Citation
+```
+@article{chu2024visionllama,
+  title={VisionLLaMA: A Unified LLaMA Interface for Vision Tasks},
+  author={Chu, Xiangxiang and Su, Jianlin and Zhang, Bo and Shen, Chunhua},
+  journal={arXiv preprint arXiv:2403.00522},
+  year={2024}
+}
+```