--- license: cc-by-nc-nd-4.0 extra_gated_fields: Name: text Company: text Country: country Specific date: date_picker I want to use this model for: type: select options: - Research - Education - label: Other value: other I agree to share generated sequences and associated data with authors before publishing: checkbox I agree not to file patents on any sequences generated by this model: checkbox I agree to use this model for non-commercial use ONLY: checkbox --- # Masked Diffusion Language Model (MDLM) for Protein Sequence Generation MDLM, introduced by Sahoo et al (arxiv.org/pdf/2406.07524), provides strong generative capabilities to BERT-style models. In this work, we pre-train and fine-tune ESM-2-150M on the MDLM objective to unconditionally generate realistic, high-quality membrane protein sequences.