RuntimeError: FlashAttention only supports Ampere GPUs or newer.

#6
by NeuralFalcon - opened
This comment has been hidden
NousResearch org

Remove the use_flash_attention_2=True line

teknium changed discussion status to closed
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment