Text Generation
NeMo
English
nvidia
steerlm
llama2
zhilinw commited on
Commit
03837fd
·
1 Parent(s): 7af111e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -2
README.md CHANGED
@@ -37,6 +37,9 @@ SteerLM Paper: [SteerLM: Attribute Conditioned SFT as an (User-Steerable) Altern
37
 
38
  Llama2-70B-SteerLM-Chat is trained with NVIDIA NeMo, an end-to-end, cloud-native framework to build, customize, and deploy generative AI models anywhere. It includes training and inferencing frameworks, guardrailing toolkits, data curation tools, and pretrained models, offering enterprises an easy, cost-effective, and fast way to adopt generative AI.
39
 
 
 
 
40
  ## Model Architecture:
41
 
42
  **Architecture Type:** Transformer
@@ -51,8 +54,6 @@ The SteerLM method involves the following key steps:
51
 
52
  Llama2-70B-SteerLM-Chat applies this technique on top of the Llama 2 70B Foundational model architecture. It was pretrained on internet-scale data and then aligned using [Open Assistant](https://huggingface.co/datasets/OpenAssistant/oasst1) and [HelpSteer](https://huggingface.co/datasets/nvidia/HelpSteer).
53
 
54
- You can train the model using [NeMo Aligner](https://github.com/NVIDIA/NeMo-Aligner) following [SteerLM training user guide](https://docs.nvidia.com/nemo-framework/user-guide/latest/modelalignment/steerlm.html) or run inference based on steps below.
55
-
56
  ## Software Integration:
57
 
58
  **Runtime Engine(s):**
 
37
 
38
  Llama2-70B-SteerLM-Chat is trained with NVIDIA NeMo, an end-to-end, cloud-native framework to build, customize, and deploy generative AI models anywhere. It includes training and inferencing frameworks, guardrailing toolkits, data curation tools, and pretrained models, offering enterprises an easy, cost-effective, and fast way to adopt generative AI.
39
 
40
+ You can train the model using [NeMo Aligner](https://github.com/NVIDIA/NeMo-Aligner) following [SteerLM training user guide](https://docs.nvidia.com/nemo-framework/user-guide/latest/modelalignment/steerlm.html) or run inference based on steps below.
41
+
42
+
43
  ## Model Architecture:
44
 
45
  **Architecture Type:** Transformer
 
54
 
55
  Llama2-70B-SteerLM-Chat applies this technique on top of the Llama 2 70B Foundational model architecture. It was pretrained on internet-scale data and then aligned using [Open Assistant](https://huggingface.co/datasets/OpenAssistant/oasst1) and [HelpSteer](https://huggingface.co/datasets/nvidia/HelpSteer).
56
 
 
 
57
  ## Software Integration:
58
 
59
  **Runtime Engine(s):**