feat: added functionality to cleave off layers from BERT encoder 86b0438 Markus28 commited on Mar 15, 2024
feat: choose flash attention heuristically if not set explicitly 2e2b8d0 Markus28 commited on Mar 6, 2024