Additional model suggestions
#4
by
MoonRide
- opened
@Goekdeniz-Guelmez There are 3 more recent Qwen models that might be good base for your J&A process:
- https://huggingface.co/Qwen/Qwen2.5-Coder-14B-Instruct (it was better generalist than normal Qwen 14B in my benchmark).
- https://huggingface.co/Qwen/Qwen2.5-7B-Instruct-1M (for folks who want to play with bigger context sizes).
- https://huggingface.co/Qwen/Qwen2.5-14B-Instruct-1M (as above).