Qwen-7B-Chat-Int8 / modeling_qwen.py

Commit History

update modeling_qwen.py
c6b0a09

yangapku commited on

update modeling_qwen.py
de89198

yangapku commited on

update modeling_qwen.py
c04bccd

yangapku commited on

update modeling_qwen.py
75cd2af

yangapku commited on

update
01803f3

yangapku commited on

remove fix-sized causal mask
dcef457

yangapku commited on

add kernel file check in modeling_qwen.py
c94803d

yangapku commited on

update modeling.py
24ac14a

yangapku commited on

update modeling_qwen.py
502a463

yangapku commited on

update batch inference
1241954

yangapku commited on

upload model
ce1512e

yangapku commited on