euclaise
/

gpt-neox-122m-minipile-digits

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

euclaise commited on Jun 9, 2023

Commit

af25d54

·

1 Parent(s): 8e0e52a

Update README.md

Files changed (1) hide show

README.md +15 -1

README.md CHANGED Viewed

@@ -7,4 +7,18 @@ language:
 library_name: transformers
 ---
-GPT-NeoX trained on MiniPile, for a baseline to compare my MANN models against.  Uses [NeelNanda/gpt-neox-tokenizer-digits](https://huggingface.co/NeelNanda/gpt-neox-tokenizer-digits) for tokenization.

 library_name: transformers
 ---
+GPT-NeoX trained on MiniPile, for a baseline to compare my MANN models against.  Uses [NeelNanda/gpt-neox-tokenizer-digits](https://huggingface.co/NeelNanda/gpt-neox-tokenizer-digits) for tokenization.
+The exact model configuration is as follows:
+```
+cfg = GPTNeoXConfig(
+    vocab_size = len(tokenizer),
+    hidden_size = 768,
+    intermediate_size = 768*4,
+    num_hidden_layers = 12,
+    num_attention_heads = 12,
+    tie_word_embeddings = True,
+    hidden_act = "gelu_new",
+    tokenizer = "NeelNanda/gpt-neox-tokenizer-digits"
+)
+```