So then Gemma3 is not capable to do multimodal inference? or there is a different way to prompt the model? I am having the same issue and I am still figuring out how to solve it
Tommaso Tubaldo
tommiTub
AI & ML interests
None yet
Recent Activity
commented on
an
article
28 days ago
Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM
Organizations
None yet
tommiTub's activity
commented on
Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM
28 days ago