flash_attn requirement prevents loading on macos

#6
by bghira - opened

Running a 128GB unified arch M3 Max and I cannot load the pipeline due to flash_attn not working on Apple MPS.

Will be updating the model shortly with the official gemma bugfixes - and lets see if that works.

shortly? :)

Sign up or log in to comment