metadata
license: mit
language:
- en
library_name: transformers
inference: false
Sharded BLIP-2 Model Card
This is a sharded version of the BLIP-2 Model Card which leverages Flan T5-xl for image-to-text tasks such as image captioning and visual question answering.
Refer to the original model card for more details about the model description, intended uses, and limitations, as well as instructions for how to use the model on CPU and GPU in different precisions.