--- tags: - esm - protein language model - protein sequence annotation license: cc-by-4.0 language: - en pipeline_tag: token-classification --- # Model Card for Model ID **NOTE:** PSALM-1 has not been trained on all Pfam families, as it has been trained and benchmarked on highly-curated datasets with strict sequence similarity guarantees between train and test data. **PSALM-1b (trained on all families in Pfam 35.0) is coming soon** PSALM-1-clan is a 69M parameter model that takes as input ESM-2 residue-level protein sequence emebeddings (unpooled) and outputs a distribution over Pfam domain clans for each amino acid in the sequence. PSALM-1-clan uses a BiLSTM followed by a stack of dense layers