Qwen2-VL-72B-Instruct-GPTQ-Int8 / model-00010-of-00021.safetensors

可亲

fix(pad zero) pad intermediate_size to 29696 to make sure quantized model can use 8 tensor-parallel in vllm

d1eab90 4 months ago

3.93 GB

This file is stored with Git LFS . It is too big to display, but you can still download it.

Git Large File Storage (LFS) replaces large files with text pointers inside Git, while storing the file contents on a remote server. More info.