RuntimeError: probability tensor contains either inf, nan or element < 0

#21

by FrankWu - opened Sep 5, 2023

Sep 5, 2023

I meet this issue "RuntimeError: probability tensor contains either inf, nan or element < 0"，my gpu is V100. Is there anyone no why?

iCSawyer

Sep 5, 2023

I ran into this problem with 7B model. I have solved it by using bf16 precision and replacing model.half() with model.bfloat16(). Maybe you can try this.

FrankWu

Sep 5, 2023

aha，V100 doesn't support bf16. It is said that fp16 usually is ok to do inference.But i also found

It seems that codellama is pretrained with bf16

iCSawyer

Sep 5, 2023

•

edited Sep 5, 2023

haha, I found this solution

I ran into this problem with 7B model. I have solved it by using bf16 precision and replacing model.half() with model.bfloat16(). Maybe you can try this.

using llama keyword which is the foundation model 🤣

maybe you can try fp32 or 8/4bit

FrankWu

Sep 6, 2023

set load_8_bit or load_4_bit =True, it's ok. But set torch_dtype=torch.float32 is still NG. It's very strange

FrankWu

Sep 6, 2023

Maybe this is just Wizardcode issue, i try another codellama version ok

FrankWu changed discussion status to closed Sep 6, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment