--- library_name: transformers tags: [] --- # Model Card for se-bert Provides Generic Software Engineering LM from "Enhancing Automated Software Traceability by Transfer Learning from Open-World Data" ## Model Details The following language models is trained on the Git Corpus and Git Links from 2016 to 2021. The data contains 4 types of records including Comments, Issues, Pull Requests, and Commits. ## Uses This model is intended to be a good set of starting weights for various software engineering tasks including: - requirements classification - traceability link prediction - retrieval / search ## Training, Evaluation, and Results Please see cited paper for complete details on training method. ## Technical Specifications ### Model Architecture and Objective MLM model trained on SE Corpus (See Above). #### Hardware 1 GPU with CUDA 10.2 or 11.1 #### Software Python >= 3.7 pytorch/1.1.0 ## Citation [optional] **BibTeX:** @misc{lin2022enhancing, title={Enhancing Automated Software Traceability by Transfer Learning from Open-World Data}, author={Jinfeng Lin and Amrit Poudel and Wenhao Yu and Qingkai Zeng and Meng Jiang and Jane Cleland-Huang}, year={2022}, eprint={2207.01084}, archivePrefix={arXiv}, primaryClass={cs.SE} } ## Model Card Authors [optional] Jinfeng Lin, Amrit Poudel, Wenhao Yu, Qingkai Zeng, Jane Cleland-Huang ## Model Card Contact Alberto Rodriguez (arodri39@nd.edu)