Generate text responses using images and text prompts
Reward-based Noise Optimization for 1-step t2i models
A VLM-based message decoder that is trained via GRPO