# ViPER-VT ## (Vision Text) This repository contains the checkpoints for the ViPER model. It is a Perceiver-based model finetuned on the concatenation of visual and textual features. For more information on how to use this model please refer to the following [repository](https://github.com/VaianiLorenzo/ViPER) If you find this useful please cite: ``` @inproceedings{vaiani2022viper, title={ViPER: Video-based Perceiver for Emotion Recognition}, author={Vaiani, Lorenzo and La Quatra, Moreno and Cagliero, Luca and Garza, Paolo}, booktitle={Proceedings of the 3rd International on Multimodal Sentiment Analysis Workshop and Challenge}, pages={67--73}, year={2022} } ``` For any other question feel free to contact me at lorenzo.vaiani@polito.it