New model from https://wandb.ai/wandb/huggingtweets/runs/226pt34g
Browse files- README.md +85 -0
- config.json +42 -0
- merges.txt +0 -0
- pytorch_model.bin +3 -0
- special_tokens_map.json +1 -0
- tokenizer.json +0 -0
- tokenizer_config.json +1 -0
- training_args.bin +3 -0
- vocab.json +0 -0
    	
        README.md
    ADDED
    
    | @@ -0,0 +1,85 @@ | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | 
|  | |
| 1 | 
            +
            ---
         | 
| 2 | 
            +
            language: en
         | 
| 3 | 
            +
            thumbnail: https://github.com/borisdayma/huggingtweets/blob/master/img/logo.png?raw=true
         | 
| 4 | 
            +
            tags:
         | 
| 5 | 
            +
            - huggingtweets
         | 
| 6 | 
            +
            widget:
         | 
| 7 | 
            +
            - text: "My dream is"
         | 
| 8 | 
            +
            ---
         | 
| 9 | 
            +
             | 
| 10 | 
            +
            <div class="inline-flex flex-col" style="line-height: 1.5;">
         | 
| 11 | 
            +
                <div class="flex">
         | 
| 12 | 
            +
                    <div
         | 
| 13 | 
            +
            			style="display:inherit; margin-left: 4px; margin-right: 4px; width: 92px; height:92px; border-radius: 50%; background-size: cover; background-image: url('https://pbs.twimg.com/profile_images/1374924360780242944/-Q8NfgEr_400x400.jpg')">
         | 
| 14 | 
            +
                    </div>
         | 
| 15 | 
            +
                    <div
         | 
| 16 | 
            +
                        style="display:inherit; margin-left: 4px; margin-right: 4px; width: 92px; height:92px; border-radius: 50%; background-size: cover; background-image: url('https://pbs.twimg.com/profile_images/1386749605216407555/QIJeyWfE_400x400.jpg')">
         | 
| 17 | 
            +
                    </div>
         | 
| 18 | 
            +
                    <div
         | 
| 19 | 
            +
                        style="display:inherit; margin-left: 4px; margin-right: 4px; width: 92px; height:92px; border-radius: 50%; background-size: cover; background-image: url('https://pbs.twimg.com/profile_images/1391882949650440200/lmEKl2ZQ_400x400.jpg')">
         | 
| 20 | 
            +
                    </div>
         | 
| 21 | 
            +
                </div>
         | 
| 22 | 
            +
                <div style="text-align: center; margin-top: 3px; font-size: 16px; font-weight: 800">🤖 AI CYBORG 🤖</div>
         | 
| 23 | 
            +
                <div style="text-align: center; font-size: 16px; font-weight: 800">wintbot_neo & wint but Al & AI Hexcrawl</div>
         | 
| 24 | 
            +
                <div style="text-align: center; font-size: 14px;">@ai_hexcrawl-dril_gpt2-drilbot_neo</div>
         | 
| 25 | 
            +
            </div>
         | 
| 26 | 
            +
             | 
| 27 | 
            +
            I was made with [huggingtweets](https://github.com/borisdayma/huggingtweets).
         | 
| 28 | 
            +
             | 
| 29 | 
            +
            Create your own bot based on your favorite user with [the demo](https://colab.research.google.com/github/borisdayma/huggingtweets/blob/master/huggingtweets-demo.ipynb)!
         | 
| 30 | 
            +
             | 
| 31 | 
            +
            ## How does it work?
         | 
| 32 | 
            +
             | 
| 33 | 
            +
            The model uses the following pipeline.
         | 
| 34 | 
            +
             | 
| 35 | 
            +
            
         | 
| 36 | 
            +
             | 
| 37 | 
            +
            To understand how the model was developed, check the [W&B report](https://wandb.ai/wandb/huggingtweets/reports/HuggingTweets-Train-a-Model-to-Generate-Tweets--VmlldzoxMTY5MjI).
         | 
| 38 | 
            +
             | 
| 39 | 
            +
            ## Training data
         | 
| 40 | 
            +
             | 
| 41 | 
            +
            The model was trained on tweets from wintbot_neo & wint but Al & AI Hexcrawl.
         | 
| 42 | 
            +
             | 
| 43 | 
            +
            | Data | wintbot_neo | wint but Al | AI Hexcrawl |
         | 
| 44 | 
            +
            | --- | --- | --- | --- |
         | 
| 45 | 
            +
            | Tweets downloaded | 3207 | 3198 | 737 |
         | 
| 46 | 
            +
            | Retweets | 268 | 41 | 26 |
         | 
| 47 | 
            +
            | Short tweets | 272 | 49 | 1 |
         | 
| 48 | 
            +
            | Tweets kept | 2667 | 3108 | 710 |
         | 
| 49 | 
            +
             | 
| 50 | 
            +
            [Explore the data](https://wandb.ai/wandb/huggingtweets/runs/2g9pfbo8/artifacts), which is tracked with [W&B artifacts](https://docs.wandb.com/artifacts) at every step of the pipeline.
         | 
| 51 | 
            +
             | 
| 52 | 
            +
            ## Training procedure
         | 
| 53 | 
            +
             | 
| 54 | 
            +
            The model is based on a pre-trained [GPT-2](https://huggingface.co/gpt2) which is fine-tuned on @ai_hexcrawl-dril_gpt2-drilbot_neo's tweets.
         | 
| 55 | 
            +
             | 
| 56 | 
            +
            Hyperparameters and metrics are recorded in the [W&B training run](https://wandb.ai/wandb/huggingtweets/runs/226pt34g) for full transparency and reproducibility.
         | 
| 57 | 
            +
             | 
| 58 | 
            +
            At the end of training, [the final model](https://wandb.ai/wandb/huggingtweets/runs/226pt34g/artifacts) is logged and versioned.
         | 
| 59 | 
            +
             | 
| 60 | 
            +
            ## How to use
         | 
| 61 | 
            +
             | 
| 62 | 
            +
            You can use this model directly with a pipeline for text generation:
         | 
| 63 | 
            +
             | 
| 64 | 
            +
            ```python
         | 
| 65 | 
            +
            from transformers import pipeline
         | 
| 66 | 
            +
            generator = pipeline('text-generation',
         | 
| 67 | 
            +
                                 model='huggingtweets/ai_hexcrawl-dril_gpt2-drilbot_neo')
         | 
| 68 | 
            +
            generator("My dream is", num_return_sequences=5)
         | 
| 69 | 
            +
            ```
         | 
| 70 | 
            +
             | 
| 71 | 
            +
            ## Limitations and bias
         | 
| 72 | 
            +
             | 
| 73 | 
            +
            The model suffers from [the same limitations and bias as GPT-2](https://huggingface.co/gpt2#limitations-and-bias).
         | 
| 74 | 
            +
             | 
| 75 | 
            +
            In addition, the data present in the user's tweets further affects the text generated by the model.
         | 
| 76 | 
            +
             | 
| 77 | 
            +
            ## About
         | 
| 78 | 
            +
             | 
| 79 | 
            +
            *Built by Boris Dayma*
         | 
| 80 | 
            +
             | 
| 81 | 
            +
            [](https://twitter.com/intent/follow?screen_name=borisdayma)
         | 
| 82 | 
            +
             | 
| 83 | 
            +
            For more details, visit the project repository.
         | 
| 84 | 
            +
             | 
| 85 | 
            +
            [](https://github.com/borisdayma/huggingtweets)
         | 
    	
        config.json
    ADDED
    
    | @@ -0,0 +1,42 @@ | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | 
|  | |
| 1 | 
            +
            {
         | 
| 2 | 
            +
              "_name_or_path": "gpt2",
         | 
| 3 | 
            +
              "activation_function": "gelu_new",
         | 
| 4 | 
            +
              "architectures": [
         | 
| 5 | 
            +
                "GPT2LMHeadModel"
         | 
| 6 | 
            +
              ],
         | 
| 7 | 
            +
              "attn_pdrop": 0.1,
         | 
| 8 | 
            +
              "bos_token_id": 50256,
         | 
| 9 | 
            +
              "embd_pdrop": 0.1,
         | 
| 10 | 
            +
              "eos_token_id": 50256,
         | 
| 11 | 
            +
              "gradient_checkpointing": false,
         | 
| 12 | 
            +
              "initializer_range": 0.02,
         | 
| 13 | 
            +
              "layer_norm_epsilon": 1e-05,
         | 
| 14 | 
            +
              "model_type": "gpt2",
         | 
| 15 | 
            +
              "n_ctx": 1024,
         | 
| 16 | 
            +
              "n_embd": 768,
         | 
| 17 | 
            +
              "n_head": 12,
         | 
| 18 | 
            +
              "n_inner": null,
         | 
| 19 | 
            +
              "n_layer": 12,
         | 
| 20 | 
            +
              "n_positions": 1024,
         | 
| 21 | 
            +
              "resid_pdrop": 0.1,
         | 
| 22 | 
            +
              "scale_attn_weights": true,
         | 
| 23 | 
            +
              "summary_activation": null,
         | 
| 24 | 
            +
              "summary_first_dropout": 0.1,
         | 
| 25 | 
            +
              "summary_proj_to_labels": true,
         | 
| 26 | 
            +
              "summary_type": "cls_index",
         | 
| 27 | 
            +
              "summary_use_proj": true,
         | 
| 28 | 
            +
              "task_specific_params": {
         | 
| 29 | 
            +
                "text-generation": {
         | 
| 30 | 
            +
                  "do_sample": true,
         | 
| 31 | 
            +
                  "max_length": 160,
         | 
| 32 | 
            +
                  "min_length": 10,
         | 
| 33 | 
            +
                  "prefix": "<|endoftext|>",
         | 
| 34 | 
            +
                  "temperature": 1.0,
         | 
| 35 | 
            +
                  "top_p": 0.95
         | 
| 36 | 
            +
                }
         | 
| 37 | 
            +
              },
         | 
| 38 | 
            +
              "torch_dtype": "float32",
         | 
| 39 | 
            +
              "transformers_version": "4.10.2",
         | 
| 40 | 
            +
              "use_cache": true,
         | 
| 41 | 
            +
              "vocab_size": 50257
         | 
| 42 | 
            +
            }
         | 
    	
        merges.txt
    ADDED
    
    | The diff for this file is too large to render. 
		See raw diff | 
|  | 
    	
        pytorch_model.bin
    ADDED
    
    | @@ -0,0 +1,3 @@ | |
|  | |
|  | |
|  | 
|  | |
| 1 | 
            +
            version https://git-lfs.github.com/spec/v1
         | 
| 2 | 
            +
            oid sha256:3d8dd4184725da7529d5690cdcaf64bbf023bfc67cb4c658a87204245d1231d2
         | 
| 3 | 
            +
            size 510403817
         | 
    	
        special_tokens_map.json
    ADDED
    
    | @@ -0,0 +1 @@ | |
|  | 
|  | |
| 1 | 
            +
            {"bos_token": "<|endoftext|>", "eos_token": "<|endoftext|>", "unk_token": "<|endoftext|>"}
         | 
    	
        tokenizer.json
    ADDED
    
    | The diff for this file is too large to render. 
		See raw diff | 
|  | 
    	
        tokenizer_config.json
    ADDED
    
    | @@ -0,0 +1 @@ | |
|  | 
|  | |
| 1 | 
            +
            {"unk_token": "<|endoftext|>", "bos_token": "<|endoftext|>", "eos_token": "<|endoftext|>", "add_prefix_space": false, "model_max_length": 1024, "special_tokens_map_file": null, "name_or_path": "gpt2", "tokenizer_class": "GPT2Tokenizer"}
         | 
    	
        training_args.bin
    ADDED
    
    | @@ -0,0 +1,3 @@ | |
|  | |
|  | |
|  | 
|  | |
| 1 | 
            +
            version https://git-lfs.github.com/spec/v1
         | 
| 2 | 
            +
            oid sha256:75d958b5b1a9c93044cc3ac23679111bbeb7e34c041a4402f359fbc4327e27a2
         | 
| 3 | 
            +
            size 2735
         | 
    	
        vocab.json
    ADDED
    
    | The diff for this file is too large to render. 
		See raw diff | 
|  | 

