Generate clusters and visualizations from images
Process video to analyze human visual motion
Transcribe audio or YouTube videos