scb10x
/

llama3.1-typhoon2-70b-instruct

Text Generation

Model card Files Files and versions Community

kunato commited on Dec 18, 2024

Commit

47dd828

·

verified ·

1 Parent(s): 192686b

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -94,7 +94,8 @@ print(tokenizer.decode(response, skip_special_tokens=True))
 ## Inference Server Hosting Example
 ```bash
 pip install vllm
-vllm serve scb10x/llama3.1-typhoon2-70b-instruct
 # see more information at https://docs.vllm.ai/
 ```

 ## Inference Server Hosting Example
 ```bash
 pip install vllm
+vllm serve scb10x/llama3.1-typhoon2-70b-instruct --tensor-parallel-size 2
+# using at least 2 80GB gpu for hosting 70b model
 # see more information at https://docs.vllm.ai/
 ```