how to fine tune this model? example code? Qlora? flash attention?
on their github I think there's more info about fine-tuninghttps://github.com/BlinkDL/RWKV-LM
· Sign up or log in to comment