Generate clickable coordinates on a screenshot
Generate images from text prompts
Localizing moments in videos via text queries
Turn video uploads into real-time narration and questions