Text Generation
Transformers
Safetensors
English
llama
meta
llama-3
conversational
text-generation-inference

Rope Theta Value Difference?

#24
by fahadh4ilyas - opened

The value of your rope theta for 8B is slightly different then what you have written in the model card. It seems that you wrote the rope theta for 70B in this model's description. Could you please write the real value for 8B? Or is the difference negligible?

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment