This model contains the weights of NExT-GPT covering text-image-video-audio (tiva), which is built upon
- Vicuna-7B with version 0
- Stable Diffusion with version
v1-5
.
- Stable Diffusion with version
- AudioLDM with version
l-full
.
- AudioLDM with version
- ZeroScope with version
v2_576w
.
- ZeroScope with version
For more details about the usage of the model, please refer to our code repository.
- Downloads last month
- 117
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the model is not deployed on the HF Inference API.