mpt-7b-8k-chat-awq / config.json

Commit History

Set initial device to cuda:0 for faster initial loading
364bcfc

casperhansen commited on

MPT 7B 8K quantized
5c660fe

casperhansen commited on