File size: 894 Bytes
74b944d 4b2d5d2 4c492d6 74b944d 7c09547 d080507 83cb829 e71c8dc 83cb829 bb51ecc |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 |
---
title: VLM Demo
sdk: docker
license: mit
---
This demo illustrates the work published in the paper ["Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models"](https://arxiv.org/pdf/2402.07865.pdf)
# Source code
For more information, please refer to this repository:
> *VLM Demo*: Lightweight repo for chatting with VLMs supported by our
[VLM Evaluation Suite](https://github.com/TRI-ML/vlm-evaluation/tree/main).
# Huffing Face Space architecture
Hugging Face Space build a container image based on the `Dockerfile`. In this file, we use the base Nvidia base image and install additional packages and external repositories.
The Hugging Face Space start the container and execute `startup.sh`. The script loads each model on a separate GPU of the 4xA10G. Then it launches several processes: one for each model, the Gradio API controller and frontend. |