Hacker News with Generative AI: PDF Parsing

Parsing PDFs (and more) in Elixir using Rust (chriis.dev)
Here's the thing about PDFs - they're complex beasts that require quite a bit of thinking to properly parse - they come in all shapes and sizes, and they can contain a lot of different types of data and formatting. 90% of the time, we just want to extract the text from the file, but that's not always easy - for the remaining 10%, well we won't be covering that in this blog post.
Ask HN: What are you using to parse PDFs for RAG? (ycombinator.com)