4M Demo
β‘
203
4M: Massively Multimodal Masked Modeling
4M: Massively Multimodal Masked Modeling
Mix two images with a slider
Generate custom images with LoRAβenhanced Stable Diffusion
Generate captions for images
Generate passportβready ID photos from a portrait
Train Free PersonalizΒ° Diff w/ Stochastic Optimal Control
Segment and caption objects in images and videos
Generate images from Japanese text prompts