flash attention

#21
by Disassemblern - opened

Is there any way to use this model for vector embedding without requiring flash attention library. Because my gpu vm is not compatible with flash attention.

Sign up or log in to comment