stan-hua's picture
AWQ model for Qwen/Qwen2-7B-Instruct: {'w_bit': 4, 'zero_point': True, 'q_group_size': 128, 'version': 'GEMM'}
45dd1c1 verified