File size: 286 Bytes
05fa730 |
1 2 3 4 5 6 |
---
pipeline_tag: image-text-to-text
---
This repository contains the Elva-Phi3-3.8B model presented in [On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning](https://huggingface.co/papers/2406.11823).
|