Model card for mr_vit_base_patch16_224_timm_pretrain_railspace_and_building

A Vision Transformer (ViT) model pre-trained on ImageNet-21k (14 million images, 21,843 classes) at resolution 224x224, and fine-tuned on ImageNet 2012 (1 million images, 1,000 classes) at resolution 224x224. Fine-tuned on gold standard annotations and outputs from early experiments using MapReader (found here).

Model Details

Model Description

Model type: Image classification /feature backbone
Finetuned from model: https://huggingface.co/google/vit-base-patch16-224

Classes and labels

0: no
1: railspace
2: building
3: railspace & building

Uses

This fine-tuned version of the model is an output of the MapReader pipeline. It was used to classify 'patch' images (cells/regions) of scanned nineteenth-century series maps of Britain provided by the National Library of Scotland (learn more here). We classified patches to indicate the presence of buildings and railway infrastructure. See our paper for more details about labels.

How to Get Started with the Model in MapReader

Please go to the MapReader documentation for instructions on how to use this model in MapReader.

Training, Evaluation and Testing Details

Training, Evaluation and Testing Data

This model was fine-tuned on manually-annotated data.

Training, Evaluation and Testing Procedure

Details can be found here.

Open access version of the article available here.

Results

Data outputs can be found here.

Further details can be found here.

More Information

This model was fine-tuned using MapReader.

The code for MapReader can be found here and the documentation can be found here.

Model Card Authors

Katie McDonough ([email protected])
Rosie Wood ([email protected])

Model Card Contact

Katie McDonough ([email protected])

Funding Statement

This work was supported by Living with Machines (AHRC grant AH/S01179X/1) and The Alan Turing Institute (EPSRC grant EP/N510129/1). Living with Machines, funded by the UK Research and Innovation (UKRI) Strategic Priority Fund, is a multidisciplinary collaboration delivered by the Arts and Humanities Research Council (AHRC), with The Alan Turing Institute, the British Library and Cambridge, King's College London, East Anglia, Exeter, and Queen Mary University of London.

Livingwithmachines
/

mr_vit_base_patch16_224_timm_pretrain_railspace_and_building