Running on Zero 62 62 VLM R1 Referral Expression 💬 Mark regions in images based on text descriptions