|
--- |
|
license: mit |
|
tags: |
|
- biology |
|
--- |
|
|
|
Pre-trained single-cell genomics models based on: |
|
- [BarlowTwins](https://proceedings.mlr.press/v139/zbontar21a/zbontar21a.pdf) |
|
- [Bootstrap Your Own Latent](https://papers.nips.cc/paper/2020/file/f3ada80d5c4ee70142b17b8192b2958e-Paper.pdf) |
|
- [Masked Autoencoder](https://openaccess.thecvf.com/content/CVPR2022/papers/He_Masked_Autoencoders_Are_Scalable_Vision_Learners_CVPR_2022_paper.pdf) |
|
- Gene-Program Masked Autoencoder |
|
|
|
Finetuned models for the downstream tasks of: |
|
- Cell Type Prediction |
|
- Gene Expression Reconstruction |
|
- Cross-Modality Prediction (RNA->Proteomics) |
|
- Data Integration |
|
|
|
Training details and adaptations to single-cell data in our project can be found in our paper below. To use the model directly, the same genes must be used in the same order as in the `var.parquet` file. Otherwise, follow the instructions from the repositories below to train a model for custom datasets. |
|
|
|
If you find our work useful, please cite the following paper: |
|
|
|
[**Delineating the Effective Use of Self-Supervised Learning in Single-Cell Genomics**](https://doi.org/10.1101/2024.02.16.580624) |
|
|
|
See also: |
|
|
|
[Repository of the full analysis](https://github.com/theislab/ssl_in_scg) |
|
|
|
[Lean repository for minimal pre-training](https://github.com/theislab/sc_mae) |