jondurbin's picture
Update README.md
c61e213
|
raw
history blame
216 Bytes
metadata
license: apache-2.0

Slightly modified mpt-30b, which has some updates to allow gradient checkpointing/etc., to be compatible with qlora training code.

Original model: https://huggingface.co/mosaicml/mpt-30b