HuangLab
/

CELL-E_2_HPA_2560

Model card Files Files and versions Community

Emaad commited on Oct 1, 2023

Commit

25f3aef

•

1 Parent(s): 75234fe

Update README.md

Files changed (1) hide show

README.md +10 -8

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ metrics:
 # CELL-E 2
 ## Model description
-[![CELL-E_2](images/architecture.png)](https://github.com/BoHuangLab/CELL-E_2)
 CELL-E 2 is the second iteration of the original [CELL-E](https://www.biorxiv.org/content/10.1101/2022.05.27.493774v1) model which utilizes an amino acid sequence and nucleus image to make predictions of subcellular protein localization with respect to the nucleus.
@@ -31,7 +31,7 @@ We have two spaces available where you can run predictions on your own data!
 - [Image Prediction](https://huggingface.co/spaces/HuangLab/CELL-E_2-Image_Prediction)
 - [Sequence Prediction](https://huggingface.co/spaces/HuangLab/CELL-E_2-Sequence_Prediction)
 ## Model variations
 We have made several versions of CELL-E 2 available. The naming scheme follows the structure ```training set_hidden size``` where the hidden size is set to the embedding dimension of the pretrained ESM-2 model.
@@ -68,6 +68,7 @@ These models were used the HPA models as checkpoints, but then were finetuned on
 | [`HPA_1280`](https://huggingface.co/HuangLab/CELL-E_2_HPA_Finetuned_1280) | 10.8 GB | |
 | [`HPA_2560`](https://huggingface.co/HuangLab/CELL-E_2_HPA_Finetuned_2560) | 17.5 GB | |
 ### How to use
@@ -85,12 +86,13 @@ model.sample(text=sequence, condition=nucleus)
 ### BibTeX entry and citation info
 ```bibtex
-@article{,
- author = {Emaad Khwaja and
- Yun S Song and
- Aaron Agarunov and
- Bo Huang},
- title = {{CELL-E 2:} Translating Proteins to Pictures and Back with a Bidirectional Text-to-Image Transforme},
 }
 ```

 # CELL-E 2
 ## Model description
+[![CELL-E_2](images/architecture.png)](https://bohuanglab.github.io/CELL-E_2/)
 CELL-E 2 is the second iteration of the original [CELL-E](https://www.biorxiv.org/content/10.1101/2022.05.27.493774v1) model which utilizes an amino acid sequence and nucleus image to make predictions of subcellular protein localization with respect to the nucleus.
 - [Image Prediction](https://huggingface.co/spaces/HuangLab/CELL-E_2-Image_Prediction)
 - [Sequence Prediction](https://huggingface.co/spaces/HuangLab/CELL-E_2-Sequence_Prediction)
 ## Model variations
 We have made several versions of CELL-E 2 available. The naming scheme follows the structure ```training set_hidden size``` where the hidden size is set to the embedding dimension of the pretrained ESM-2 model.
 | [`HPA_1280`](https://huggingface.co/HuangLab/CELL-E_2_HPA_Finetuned_1280) | 10.8 GB | |
 | [`HPA_2560`](https://huggingface.co/HuangLab/CELL-E_2_HPA_Finetuned_2560) | 17.5 GB | |
+To reduce download size, we removed the ESM-2 model from the checkpoint. This should be downloaded the first time the code is run, but is otherwise something to be aware of if loading into other projects.
 ### How to use
 ### BibTeX entry and citation info
 ```bibtex
+@inproceedings{
+anonymous2023translating,
+title={CELL-E 2: Translating Proteins to Pictures and Back with a Bidirectional Text-to-Image Transformer},
+author={Emaad Khwaja, Yun S. Song, Aaron Agarunov, and Bo Huang},
+booktitle={Thirty-seventh Conference on Neural Information Processing Systems},
+year={2023},
+url={https://openreview.net/forum?id=YSMLVffl5u}
 }
 ```