davidshtian
/

Mistral-7B-Instruct-v0.2-neuron-1x2048-2-cores-2.18

Text Generation

Model card Files Files and versions Community

davidshtian commited on Apr 2

Commit

ee0bcdd

•

1 Parent(s): d880cf5

Update README.md

Files changed (1) hide show

README.md +16 -0

README.md CHANGED Viewed

@@ -18,6 +18,22 @@ Please refer to the 🤗 `optimum-neuron` [documentation](https://huggingface.co
 Note: To compile the mistralai/Mistral-7B-Instruct-v0.2 on Inf2, you need to update the model config sliding_window (either file or model variable) from null to default 4096.
 ## Usage with 🤗 `optimum-neuron`
 ```python

 Note: To compile the mistralai/Mistral-7B-Instruct-v0.2 on Inf2, you need to update the model config sliding_window (either file or model variable) from null to default 4096.
+## Usage with 🤗 `TGI`
+```shell
+export HF_TOKEN="hf_xxx"
+docker run -d -p 8080:80 \
+       -v $(pwd)/data:/data \
+       --device=/dev/neuron0 \
+       -e HF_TOKEN=${HF_TOKEN} \
+       public.ecr.aws/shtian/neuronx-tgi:latest \
+       --model-id davidshtian/Mistral-7B-Instruct-v0.2-neuron-1x2048-2-cores-2.18 \
+       --max-batch-size 1 \
+       --max-input-length 16 \
+       --max-total-tokens 32
+```
 ## Usage with 🤗 `optimum-neuron`
 ```python