Additional model suggestions

#4
by MoonRide - opened

@Goekdeniz-Guelmez There are 3 more recent Qwen models that might be good base for your J&A process:

  1. https://huggingface.co/Qwen/Qwen2.5-Coder-14B-Instruct (it was better generalist than normal Qwen 14B in my benchmark).
  2. https://huggingface.co/Qwen/Qwen2.5-7B-Instruct-1M (for folks who want to play with bigger context sizes).
  3. https://huggingface.co/Qwen/Qwen2.5-14B-Instruct-1M (as above).

Sign up or log in to comment