A test version, trying to align Qwen2.5-Coder-1.5B to make it output the thinking process.