UGround
📱
16
Extract text from images using various OCR modes
Track points in a video
Describe image contents with prompts
Answer questions about uploaded videos or images
Easy converting PDF and Office docs into Markdown and JSON
Visual Retrieval with ColPali and Vespa
Generate clickable coordinates on a screenshot
Demo for https://github.com/Byaidu/PDFMathTranslate
Controlling Computers with Small Models
Generate full app code from a simple description