I tried to test this model with llama.cpp and the quants from "DevQuasar/VAGOsolutions.SauerkrautLM-v2-14b-DPO-GGUF" and it only output random timestamps.
I tried with and without flash_attention, both times similiar result.
· Sign up or log in to comment