Set use_cache to True, otherwise inference performance is poor (#2) 64c10ed winglian TheBloke commited on Jun 12, 2023