SPIDER: A Comprehensive Multi-Organ Supervised Pathology Dataset and Baseline Models
Abstract
Advancing AI in computational pathology requires large, high-quality, and diverse datasets, yet existing public datasets are often limited in organ diversity, class coverage, or annotation quality. To bridge this gap, we introduce SPIDER (Supervised Pathology Image-DEscription Repository), the largest publicly available patch-level dataset covering multiple organ types, including Skin, Colorectal, and Thorax, with comprehensive class coverage for each organ. SPIDER provides high-quality annotations verified by expert pathologists and includes surrounding context patches, which enhance classification performance by providing spatial context. Alongside the dataset, we present baseline models trained on SPIDER using the Hibou-L foundation model as a feature extractor combined with an attention-based classification head. The models achieve state-of-the-art performance across multiple tissue categories and serve as strong benchmarks for future digital pathology research. Beyond patch classification, the model enables rapid identification of significant areas, quantitative tissue metrics, and establishes a foundation for multimodal approaches. Both the dataset and trained models are publicly available to advance research, reproducibility, and AI-driven pathology development. Access them at: https://github.com/HistAI/SPIDER
Community
New dataset, what do you think?
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- A Survey on Computational Pathology Foundation Models: Datasets, Adaptation Strategies, and Evaluation Tasks (2025)
- Atlas: A Novel Pathology Foundation Model by Mayo Clinic, Charité, and Aignostics (2025)
- A Lightweight and Extensible Cell Segmentation and Classification Model for Whole Slide Images (2025)
- "No negatives needed": weakly-supervised regression for interpretable tumor detection in whole-slide histopathology images (2025)
- Can We Simplify Slide-level Fine-tuning of Pathology Foundation Models? (2025)
- Unveiling Institution-Specific Bias in Pathology Foundation Models: Detriments, Causes, and Potential Solutions (2025)
- Weakly Supervised Pixel-Level Annotation with Visual Interpretability (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 3
Datasets citing this paper 3
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper