when i run command ,it didnot work. ( via vllm 0.7.3)

#16

by xueshuai - opened 3 days ago

Discussion

xueshuai

3 days ago

when i run command ,it didnot work. how much deploy-time should be ?

v2ray

Cognitive Computations org 3 days ago

I failed to see where it didn't work, maybe try providing more logs and describing things properly?

xueshuai

about 3 hours ago

I failed to see where it didn't work, maybe try providing more logs and describing things properly?

NCCL P2P DISABLE=1 VLLM_WORKER_MULTIPROC_METHOD=spawn python -m vllm.entrypoints.openai.api_server --host 0.0.0.0 --port 12345 --max-model-len 65536 --max-num-batched-tokens 65536 --trust-remote-code --tensor-parallel-size 8 --gpu-memory-utilization 0.97 --dtype float16 --served-model-name deepseek-reasoner --model cognitivecomputations/DeepSeek-R1-AWQ it wored

xueshuai changed discussion status to closed about 3 hours ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment