Create images from pose-guided prompts
Generate 3D room layouts from RGB panoramas
Describe images using multiple models