|
--- |
|
tags: |
|
- esm |
|
- protein language model |
|
- protein sequence annotation |
|
license: cc-by-4.0 |
|
language: |
|
- en |
|
pipeline_tag: token-classification |
|
--- |
|
|
|
# Model Card for Model ID |
|
|
|
**NOTE:** PSALM-1 has not been trained on all Pfam families, as it has been trained and benchmarked on highly-curated datasets with strict sequence similarity guarantees between train and test data. **PSALM-1b (trained on all families in Pfam 35.0) is coming soon** |
|
|
|
PSALM-1-clan is a 69M parameter model that takes as input ESM-2 residue-level protein sequence emebeddings (unpooled) and outputs a distribution over Pfam domain clans for each amino acid in the sequence. PSALM-1-clan uses a BiLSTM followed by a stack of dense layers |
|
|
|
|