Update README.md
Browse files
README.md
CHANGED
@@ -37,6 +37,9 @@ SteerLM Paper: [SteerLM: Attribute Conditioned SFT as an (User-Steerable) Altern
|
|
37 |
|
38 |
Llama2-70B-SteerLM-Chat is trained with NVIDIA NeMo, an end-to-end, cloud-native framework to build, customize, and deploy generative AI models anywhere. It includes training and inferencing frameworks, guardrailing toolkits, data curation tools, and pretrained models, offering enterprises an easy, cost-effective, and fast way to adopt generative AI.
|
39 |
|
|
|
|
|
|
|
40 |
## Model Architecture:
|
41 |
|
42 |
**Architecture Type:** Transformer
|
@@ -51,8 +54,6 @@ The SteerLM method involves the following key steps:
|
|
51 |
|
52 |
Llama2-70B-SteerLM-Chat applies this technique on top of the Llama 2 70B Foundational model architecture. It was pretrained on internet-scale data and then aligned using [Open Assistant](https://huggingface.co/datasets/OpenAssistant/oasst1) and [HelpSteer](https://huggingface.co/datasets/nvidia/HelpSteer).
|
53 |
|
54 |
-
You can train the model using [NeMo Aligner](https://github.com/NVIDIA/NeMo-Aligner) following [SteerLM training user guide](https://docs.nvidia.com/nemo-framework/user-guide/latest/modelalignment/steerlm.html) or run inference based on steps below.
|
55 |
-
|
56 |
## Software Integration:
|
57 |
|
58 |
**Runtime Engine(s):**
|
|
|
37 |
|
38 |
Llama2-70B-SteerLM-Chat is trained with NVIDIA NeMo, an end-to-end, cloud-native framework to build, customize, and deploy generative AI models anywhere. It includes training and inferencing frameworks, guardrailing toolkits, data curation tools, and pretrained models, offering enterprises an easy, cost-effective, and fast way to adopt generative AI.
|
39 |
|
40 |
+
You can train the model using [NeMo Aligner](https://github.com/NVIDIA/NeMo-Aligner) following [SteerLM training user guide](https://docs.nvidia.com/nemo-framework/user-guide/latest/modelalignment/steerlm.html) or run inference based on steps below.
|
41 |
+
|
42 |
+
|
43 |
## Model Architecture:
|
44 |
|
45 |
**Architecture Type:** Transformer
|
|
|
54 |
|
55 |
Llama2-70B-SteerLM-Chat applies this technique on top of the Llama 2 70B Foundational model architecture. It was pretrained on internet-scale data and then aligned using [Open Assistant](https://huggingface.co/datasets/OpenAssistant/oasst1) and [HelpSteer](https://huggingface.co/datasets/nvidia/HelpSteer).
|
56 |
|
|
|
|
|
57 |
## Software Integration:
|
58 |
|
59 |
**Runtime Engine(s):**
|