In-browser unified multimodal understanding and generation.
A simple app for doing HTR with various models.