some quston about post-traing (function calling)

#4
by postitive666 - opened

hello !I use this model in the second stage of Function calling, and the model alters the image URL generated by my function call." Is this due to post-training? Any suggestions?

Owner

Can U try this with the base models of qwen and see wether it happens there also?
Provide as well the prompt so I can reproduce it.

hello , the original instruction model of Qwen is relatively more stable in terms of performance. Here is an example of a model hallucination I encountered after calling the image generation tool:
{
"generate_image": {
"prompt": "a cat",
"toolbench_rapidapi_key": ""(error generate)
}
}
My interface should only have one prompt, but the model generates a new "toolbench_rapidapi_key" on its own. I have tried some simple adjustments, such as adjusting the prompt or performing some rule matching to identify some regular errors. I can handle such issues, but when, for example, I generate some URLs like 'geni.static/xxx.png' and then provide them to the large model for summarization, it might alter my URLs, such as changing them to 'https://geni/static' or '/geni.xxxstaice/xx.png'

postitive666 changed discussion status to closed
postitive666 changed discussion status to open
Owner

I dont see the prompt itself and the answer of the model.
Please provide:

  • Hyperparameters (temp, top_p, top_k)
  • System Prompt
  • Input Prompt

For both Base Model and Cybertron model, thanks

"Sorry for the late reply. I did some comparisons and found that the main issue is not with this model. When calling the drawing API, some hallucinations may occur, but this does not affect the usage. QWEN may also recognize errors. Do you have experience with multi-turn tool calls? Can you provide some guidance?"

Owner

unfortunately not, i havent used this feature of LLM's.. been busy with other stuff. But i let the thread open just in case someone can help u.

Sign up or log in to comment