Qwen2-VL-72B-Instruct-GPTQ-Int8 / model-00010-of-00021.safetensors
可亲
fix(pad zero) pad intermediate_size to 29696 to make sure quantized model can use 8 tensor-parallel in vllm
d1eab90
This file is stored with Git LFS . It is too big to display, but you can still download it.

Git LFS Details

  • SHA256: 081e6a9f411358b7a91b3444f4ae49534d884232e05e6d9b420d6339e3d25662
  • Pointer size: 135 Bytes
  • Size of remote file: 3.93 GB

Git Large File Storage (LFS) replaces large files with text pointers inside Git, while storing the file contents on a remote server. More info.