README.md · keras-io/video-vision-transformer at 55e85e67c1afd067932ec8dc415ef693b7fe18d8

metadata

title: Video Vision Transformer on medmnist
emoji: 🧑‍⚕️
colorFrom: red
colorTo: green
sdk: gradio
app_file: app.py
pinned: false
license: apache-2.0
library_name: keras

Keras Implementation of Video Vision Transformer on medmnist

This repo contains the model to this Keras example on Video Vision Transformer.

Background Information

This example implements ViViT: A Video Vision Transformer by Arnab et al., a pure Transformer-based model for video classification. The authors propose a novel embedding scheme and a number of Transformer variants to model video clips.

Datasets

We use the MedMNIST v2: A Large-Scale Lightweight Benchmark for 2D and 3D Biomedical Image Classification dataset.