Hacker News with Generative AI: OCR

Llama-OCR: Document to Markdown (llamaocr.com)
Upload an image to turn it into structured markdown
Show HN: I launched a super cheap and simple to use OCR tool for macOS (textcapture.app)
Ever tried to quickly copy and paste some text, only to realize it's unselectable or embedded in a video or image? That happens to me all the time!
Show HN: PDF to MD by LLMs – Extract Text/Tables/Image Descriptives by GPT4o (github.com/yigitkonur)
Swift OCR: LLM Powered Fast OCR ⚡
General OCR Theory: Towards OCR-2.0 via a Unified End-to-End Model (huggingface.co)
Traditional OCR systems (OCR-1.0) are increasingly unable to meet people's usage due to the growing demand for intelligent processing of man-made optical characters.
Show HN: LLM-aided OCR – Correcting Tesseract OCR errors with LLMs (github.com/Dicklesworthstone)
Show HN: Zerox – Document OCR with GPT-mini (github.com/getomni-ai)
Ask HN: How to OCR a PDF and preserve whitespace? (ycombinator.com)
OCR Tools for Mac, iOS and Windows (rorybowcott.com)
Extracting Words from Scanned Books: A Step-by-Step Tutorial with Python, OpenCV (github.com/feitgemel)