meta-llama/Meta-Llama-3-70B-Instruct

Update config.json3c4fd3a4

ArthurZ

Meta Llama org Apr 25, 2024

No description provided.

oldmanhuggingface

May 10, 2024

Is this supposed to be merged or not?

ArthurZ

Meta Llama org May 10, 2024

I think we should only use 128009 as the eos

ArthurZ changed pull request status to closed May 10, 2024

oldmanhuggingface

May 10, 2024

So were you guys gonna make these updates or what? I see you merged the pull request for llama-3-8b-instruct tokenizer_config.json, but you're saying config.json should be changed so the eos is only 128009? Do I have that right? So we still need updates for config.json on both instruct models and tokenizer_config.json on 70b-instruct? And what about the base models? Why aren't you guys taking this more seriously? Again, this isn't about me or the ease with which I can make these changes. It's about publishing complete versions of your billion-dollar project that the community can use with confidence. Also, why don't you guys know the correct parameters for your model? I'm genuinely confused about all of this

epignatelli

May 18, 2024

I think we should only use 128009 as the eos

It would be great to give the reasons behind this, especially if that rejects a PR that fixes an issue spread to all the user base.
Why not adding 128009 as termination criterion?