|
--- |
|
license: mit |
|
language: |
|
- en |
|
library_name: transformers |
|
inference: False |
|
--- |
|
## Sharded BLIP-2 Model Card |
|
|
|
This is a sharded version of the [BLIP-2 Model Card](https://huggingface.co/models/Salesforce/blip2-flan-t5-xl) which leverages [Flan T5-xl](https://huggingface.co/google/flan-t5-xl) for image-to-text tasks such as image captioning and visual question answering. |
|
|
|
Refer to the [original model card](https://huggingface.co/models/Salesforce/blip2-flan-t5-xl) for more details about the model description, intended uses, and limitations, as well as instructions for how to use the model on CPU and GPU in different precisions. |