--- title: Handwriting OCR Data Collection emoji: ✍️ colorFrom: blue colorTo: indigo sdk: gradio sdk_version: 5.15.0 app_file: app.py pinned: false short_description: Collect handwritten text samples for OCR training tags: - ocr - handwriting - dataset - computer-vision hf_oauth: true hf_oauth_expiration_minutes: 480 hf_oauth_scopes: - read-repos - write-repos - manage-repos - inference-api --- # Handwriting OCR Dataset Collection This Space provides an interface for collecting handwritten samples of text to create a dataset for OCR (Optical Character Recognition) training. Users are presented with text snippets which they can handwrite and upload as images. ## How it Works 1. You will be shown 1-5 consecutive sentences about OCR and handwriting recognition 2. Write these sentences by hand on paper 3. Take a photo or scan of your handwriting 4. Upload the image through the interface 5. Submit or skip to get a new text block The collected data pairs (text and corresponding handwritten images) will be used to train and improve handwriting recognition models. ## Usage Simply visit the Space and follow the on-screen instructions to contribute your handwriting samples to the dataset.