keras
/

llama3_instruct_8b_en

Text Generation

text-generation-inference

text-to-text-generation

text-conversation

Model card Files Files and versions Community

llama3_instruct_8b_en / README.md

Divyasreepat's picture

Upload folder using huggingface_hub

d40e46c verified 18 days ago

|

733 Bytes

	---
	library_name: keras-hub
	---
	This is a [`Llama3` model](https://keras.io/api/keras_hub/models/llama3) uploaded using the KerasHub library and can be used with JAX, TensorFlow, and PyTorch backends.
	Model config:
	* name: llama_backbone
	* trainable: True
	* vocabulary_size: 128256
	* num_layers: 32
	* num_query_heads: 32
	* hidden_dim: 4096
	* intermediate_dim: 14336
	* rope_max_wavelength: 500000.0
	* rope_scaling_factor: 1.0
	* num_key_value_heads: 8
	* layer_norm_epsilon: 1e-05
	* dropout: 0

	This model card has been generated automatically and should be completed by the model author. See [Model Cards documentation](https://huggingface.co/docs/hub/model-cards) for more information.