Mixed-language images: keeping OCR language settings honest
Multilingual UI OCR shows up in global product screenshots, airport signage photos, menus, and packaging where Latin letters sit next to Chinese, Japanese, or Korean. Picking the wrong primary language can corrupt an entire paragraph, so on `multilingual-ocr-browser` split the canvas: crop the button row separately from legal copy, OCR each crop with the right language hint, then merge the transcript while preserving punctuation. Vertical display type and stylized logos may never OCR perfectly—plan for manual touch-up. Redact account numbers before sharing samples in public channels, and log which language profile you used so the next reviewer can reproduce the same run.
Multilingual OCR checklist (`multilingual-ocr-browser`)
- Open `multilingual-ocr-browser`, upload the mixed-language capture, outline primary versus secondary text blocks, and crop them if they need different language settings.
- Run OCR per block, stitch the narrative in order, and fix spacing issues caused by script changes; verify proper nouns against the source image.
- Publish the merged transcript with a short note listing languages and crops, and archive the original image for translators or compliance reviewers.