Set use_cache to True, otherwise inference performance is poor 5e8c41a TheBloke commited on May 31, 2023