Qwen
/

Qwen-7B-Chat-Int8

Text Generation

8-bit precision

Model card Files Files and versions Community

Qwen-7B-Chat-Int8 / modeling_qwen.py

Commit History

update modeling_qwen.py

c6b0a09

yangapku commited on Dec 7, 2023

update modeling_qwen.py

de89198

yangapku commited on Dec 6, 2023

update modeling_qwen.py

c04bccd

yangapku commited on Dec 4, 2023

update modeling_qwen.py

75cd2af

yangapku commited on Dec 3, 2023

update

01803f3

yangapku commited on Nov 30, 2023

remove fix-sized causal mask

dcef457

yangapku commited on Nov 14, 2023

add kernel file check in modeling_qwen.py

c94803d

yangapku commited on Nov 5, 2023

update modeling.py

24ac14a

yangapku commited on Oct 26, 2023

update modeling_qwen.py

502a463

yangapku commited on Oct 16, 2023

update batch inference

1241954

yangapku commited on Oct 14, 2023

upload model

ce1512e

yangapku commited on Oct 11, 2023