metadata
tags:
- esm
- protein language model
- protein sequence annotation
license: cc-by-4.0
language:
- en
pipeline_tag: token-classification
Model Card for Model ID
NOTE: PSALM-1 has not been trained on all Pfam families, as it has been trained and benchmarked on highly-curated datasets with strict sequence similarity guarantees between train and test data. PSALM-1b (trained on all families in Pfam 35.0) is coming soon
PSALM-1-clan is a 69M parameter model that takes as input ESM-2 residue-level protein sequence emebeddings (unpooled) and outputs a distribution over Pfam domain clans for each amino acid in the sequence. PSALM-1-clan uses a BiLSTM followed by a stack of dense layers