keras
/

gpt2_large_en

Text Generation

Model card Files Files and versions Community

gpt2_large_en / README.md

Divyasreepat's picture

Upload folder using huggingface_hub

c62471a verified 20 days ago

|

626 Bytes

	---
	library_name: keras-hub
	---
	This is a [`GPT2` model](https://keras.io/api/keras_hub/models/gpt2) uploaded using the KerasHub library and can be used with JAX, TensorFlow, and PyTorch backends.
	Model config:
	* name: gpt2_backbone
	* trainable: True
	* vocabulary_size: 50257
	* num_layers: 36
	* num_heads: 20
	* hidden_dim: 1280
	* intermediate_dim: 5120
	* dropout: 0.1
	* max_sequence_length: 1024

	This model card has been generated automatically and should be completed by the model author. See [Model Cards documentation](https://huggingface.co/docs/hub/model-cards) for more information.