OOM when quantizing for 32k context length
#3 opened about 1 year ago
by
harshilp
Code is looking for 'modeling_flash_llama.py' on huggingface even though I have it in local folder
#2 opened over 1 year ago
by
alexrider
Fine tuning this model further
1
#1 opened over 1 year ago
by
sdranju