groma-7b-finetune / README.md
nielsr's picture
nielsr HF staff
Add model card, tags, link to paper
f3891c4 verified
|
raw
history blame
304 Bytes
metadata
datasets:
  - FoundationVision/groma_instruct
language:
  - en
pipeline_tag: image-text-to-text
library_name: transformers

This repository contains the model of the paper Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models.