Before: If user tries to run the code with float32 setting and flash_attention, they will encounter an undefined variable error, due to misspelling of self.query.
After: If user tries to run the code with float32 setting and flash_attention, they will correctly get a float32 not supported error

sdadas changed pull request status to merged

Sign up or log in to comment