upstage
/

llama-30b-instruct-2048

@@ -18,30 +18,19 @@ pipeline_tag: text-generation
 ## Model Details
-### Model Developers
-- [Upstage](https://en.upstage.ai)
-### Backbone Model
-- [LLaMA](https://github.com/facebookresearch/llama/tree/llama_v1)
-### Variations
-- It has different model parameter sizes and sequence lengths: [30B/1024](https://huggingface.co/upstage/llama-30b-instruct), [30B/2048](https://huggingface.co/upstage/llama-30b-instruct-2048), [65B/1024](https://huggingface.co/upstage/llama-65b-instruct).
-### Input
-- Models solely process textual input.
-### Output
-- Models solely generate textual output.
-### License
-- This model is under a **Non-commercial** Bespoke License and governed by the Meta license. You should only use this repository if you have been granted access to the model by filling out [this form](https://docs.google.com/forms/d/e/1FAIpQLSfqNECQnMkycAp2jP4Z9TFX0cGR4uf7b_fBxjY_OjhJILlKGA/viewform), but have either lost your copy of the weights or encountered issues converting them to the Transformers format.
-### Where to send comments
-- Instructions on how to provide feedback or comments on a model can be found by opening an issue in the [Hugging Face community's model repository](https://huggingface.co/upstage/llama-30b-instruct-2048/discussions).
 ## Dataset Details
 ### Used Datasets
 - [openbookqa](https://huggingface.co/datasets/openbookqa)
 - [sciq](https://huggingface.co/datasets/sciq)
 - [Open-Orca/OpenOrca](https://huggingface.co/datasets/Open-Orca/OpenOrca)
@@ -62,11 +51,8 @@ pipeline_tag: text-generation
 ## Hardware and Software
-### Hardware
-- We utilized an A100x8 for training our model.
-### Training Factors
-- We fine-tuned this model using a combination of the [DeepSpeed library](https://github.com/microsoft/DeepSpeed) and the [HuggingFace trainer](https://huggingface.co/docs/transformers/main_classes/trainer).
 ## Evaluation Results

 ## Model Details
+* **Developed by**: [Upstage](https://en.upstage.ai)
+* **Backbone Model**: [LLaMA](https://github.com/facebookresearch/llama/tree/llama_v1)
+* **Variations**: It has different model parameter sizes and sequence lengths: [30B/1024](https://huggingface.co/upstage/llama-30b-instruct), [30B/2048](https://huggingface.co/upstage/llama-30b-instruct-2048), [65B/1024](https://huggingface.co/upstage/llama-65b-instruct)
+* **Language(s)**: English
+* **Library**: [HuggingFace Transformers](https://github.com/huggingface/transformers)
+* **License**: This model is under a **Non-commercial** Bespoke License and governed by the Meta license. You should only use this repository if you have been granted access to the model by filling out [this form](https://docs.google.com/forms/d/e/1FAIpQLSfqNECQnMkycAp2jP4Z9TFX0cGR4uf7b_fBxjY_OjhJILlKGA/viewform), but have either lost your copy of the weights or encountered issues converting them to the Transformers format
+* **Where to send comments**: Instructions on how to provide feedback or comments on a model can be found by opening an issue in the [Hugging Face community's model repository](https://huggingface.co/upstage/llama-30b-instruct-2048/discussions)
+* **Contact**: For questions and comments about the model, please email `contact@upstage.ai`
 ## Dataset Details
 ### Used Datasets
 - [openbookqa](https://huggingface.co/datasets/openbookqa)
 - [sciq](https://huggingface.co/datasets/sciq)
 - [Open-Orca/OpenOrca](https://huggingface.co/datasets/Open-Orca/OpenOrca)
 ## Hardware and Software
+* **Hardware**: We utilized an A100x8 for training our model
+* **Training Factors**: We fine-tuned this model using a combination of the [DeepSpeed library](https://github.com/microsoft/DeepSpeed) and the [HuggingFace trainer](https://huggingface.co/docs/transformers/main_classes/trainer)
 ## Evaluation Results