second-state
/

Qwen1.5-110B-Chat-GGUF

Text Generation

Model card Files Files and versions Community

apepkuss79 commited on May 26

Commit

6b1cec5

•

1 Parent(s): 859e9a6

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -56,7 +56,7 @@ tags:
   wasmedge --dir .:. --nn-preload default:GGML:AUTO:Qwen1.5-110B-Chat-Q2_K.gguf \
     llama-api-server.wasm \
     --prompt-template chatml \
-    --ctx-size 8192 \
     --model-name qwen1.5-110b-chat
   ```
@@ -65,7 +65,8 @@ tags:
   ```bash
   wasmedge --dir .:. --nn-preload default:GGML:AUTO:Qwen1.5-110B-Chat-Q2_K.gguf \
     llama-chat.wasm \
-    --prompt-template chatml
   ```
 ## Quantized GGUF Models

   wasmedge --dir .:. --nn-preload default:GGML:AUTO:Qwen1.5-110B-Chat-Q2_K.gguf \
     llama-api-server.wasm \
     --prompt-template chatml \
+    --ctx-size 32000 \
     --model-name qwen1.5-110b-chat
   ```
   ```bash
   wasmedge --dir .:. --nn-preload default:GGML:AUTO:Qwen1.5-110B-Chat-Q2_K.gguf \
     llama-chat.wasm \
+    --prompt-template chatml \
+    --ctx-size 32000
   ```
 ## Quantized GGUF Models