# Model Card for GPT_2_CODE WIP, Goal is to create a small GPT2 python coder # Table of Contents - [Model Card for GPT_2_CODE](#model-card-for--model_id-) - [Table of Contents](#table-of-contents) - [Table of Contents](#table-of-contents-1) - [Model Details](#model-details) - [Model Description](#model-description) - [Uses](#uses) - [Direct Use](#direct-use) - [Downstream Use [Optional]](#downstream-use-optional) - [Out-of-Scope Use](#out-of-scope-use) - [Bias, Risks, and Limitations](#bias-risks-and-limitations) - [Recommendations](#recommendations) - [Training Details](#training-details) - [Training Data](#training-data) - [Training Procedure](#training-procedure) - [Preprocessing](#preprocessing) - [Speeds, Sizes, Times](#speeds-sizes-times) - [Evaluation](#evaluation) - [Testing Data, Factors & Metrics](#testing-data-factors--metrics) - [Testing Data](#testing-data) - [Factors](#factors) - [Metrics](#metrics) - [Results](#results) - [Model Examination](#model-examination) - [Environmental Impact](#environmental-impact) - [Technical Specifications [optional]](#technical-specifications-optional) - [Model Architecture and Objective](#model-architecture-and-objective) - [Compute Infrastructure](#compute-infrastructure) - [Hardware](#hardware) - [Software](#software) - [Citation](#citation) - [Glossary [optional]](#glossary-optional) - [More Information [optional]](#more-information-optional) - [Model Card Authors [optional]](#model-card-authors-optional) - [Model Card Contact](#model-card-contact) - [How to Get Started with the Model](#how-to-get-started-with-the-model) # Model Details ## Model Description WIP, Goal is to create a small GPT2 python coder - **Developed by:** C, o, d, e, M, o, n, k, e, y - **Shared by [Optional]:** More information needed - **Model type:** Language model - **Language(s) (NLP):** eng - **License:** wtfpl - **Parent Model:** More information needed - **Resources for more information:** More information needed - [GitHub Repo](None) - [Associated Paper](None) # Uses ## Direct Use generate python code snippets ## Downstream Use [Optional] semi auto coder ## Out-of-Scope Use describe code # Bias, Risks, and Limitations Significant research has explored bias and fairness issues with language models (see, e.g., [Sheng et al. (2021)](https://aclanthology.org/2021.acl-long.330.pdf) and [Bender et al. (2021)](https://dl.acm.org/doi/pdf/10.1145/3442188.3445922)). Predictions generated by the model may include disturbing and harmful stereotypes across protected classes; identity characteristics; and sensitive, social, and occupational groups. ## Recommendations Keep Finetuning on question/python datasets # Training Details ## Training Data flytech/python-codes-25k espejelomar/code_search_net_python_10000_examples ## Training Procedure ### Preprocessing More information needed ### Speeds, Sizes, Times Epochs 3 flytech/python-codes-25k (4600) Training Loss: 0.4007 Validation Loss: 0.5526 Epochs 3 espejelomar/code_search_net_python_10000_examples (4800) Training Loss: 1.5355 Validation Loss: 1.1723 # Evaluation ## Testing Data, Factors & Metrics ### Testing Data flytech/python-codes-25k espejelomar/code_search_net_python_10000_examples ### Factors 80/20 train/val ### Metrics train/validate lr scheduling ## Results Better in python code generation as base gpt2-medium model # Model Examination More information needed # Environmental Impact Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700). - **Hardware Type:** CPU and Colab T4 - **Hours used:** 4 - **Cloud Provider:** Google Colab - **Compute Region:** NL - **Carbon Emitted:** ??? # Technical Specifications [optional] ## Model Architecture and Objective gpt2 ## Compute Infrastructure More information needed ### Hardware CPU and Colab T4 ### Software pytorch, custom python # Citation **BibTeX:** More information needed **APA:** More information needed # Glossary [optional] More information needed # More Information [optional] Experimental # Model Card Authors [optional] CodeMonkeyXL # Model Card Contact K00B404 huggingface # How to Get Started with the Model Use the code below to get started with the model.
Click to expand More information needed