Hacker News with Generative AI: PDF

Auntie PDF – an open source app built using Mistral OCR (auntiepdf.com)
Your all-knowing guide that unpacks every PDF into clear, actionable insights. Just like your favorite aunt, but for documents!
Tips for using Gemini 2.0 for PDF ingestion (sergey.fyi)
Ask HN: Where are the good Markdown to PDF tools (that meet these requirements)? (ycombinator.com)
I'm trying to convert a very large Markdown file (a couple hundred pages) to PDF.
What can be computed? A practical guide to the theory of computation (2018) [pdf] (softouch.on.ca)
OlmOCR: Open-source tool to extract plain text from PDFs (allenai.org)
Show HN: I built a free and open source pdf redaction tool (magicredact.com)
Need to hide confidential data from PDFs or images? Our free redaction tool makes it easy. Just upload your file, and we'll automatically detect and remove sensitive text. No manual work required.
iText PDF Library turns 25 (itextpdf.com)
On 14th February 2000 the first public version of the iText PDF library was released to the open-source community. A quarter of a century later, Apryse is proudly celebrating with the release of iText Suite 9.1 on iText’s 25th anniversary, which is also Valentine’s Day! This release brings significantly expanded SVG and CSS support, huge performance increases, GraalVM for pdfHTML, and a whole lot of love!
Tea Extensions [pdf] (2006) (tayloredge.com)
Show HN: HTML visualization of a PDF file's internal structure (github.com/desgeeko)
Inspecting the internal structure of a PDF file involves a lot of things (decompression, parsing, xref indexing, etc...) in order to make sense of the raw bytes.
Linux Running in a PDF (doompdf.dev)
Linux running inside a PDF file via a JavaScript-compiled RISC-V emulator (xda-developers.com)
Mutool – all purpose tool for dealing with PDF files (mankier.com)
mutool is a tool based on MuPDF for dealing with document files in various manners. There are several sub commands available, as described below.
Linux running inside a PDF file via a RISC-V emulator (github.com/ading2210)
This is Linux running inside a PDF file via a RISC-V emulator, which is based on TinyEMU.
Show HN: Tetris in a PDF (th0mas.nl)
Tdf: Terminal-Based PDF Viewer (github.com/itsjunetime)
A terminal-based PDF viewer.
Show HN: CxReports – Low-Code Tool for User-Facing PDF Reports (ycombinator.com)
Marko here from Codaxy. For over two years, we have been working on CxReports, a low-code tool for creating user-facing PDF documents and reports.
Copy, Paste, Invert, Forget (2011) [pdf] (atelier-hirschbichler.com)
Converting untrusted PDFs into trusted ones: The Qubes Way (2013) (invisiblethings.org)
Arguably one of the biggest challenges for desktop security is how to handle those overly complex PDFs, DOCs, and similar files, that are so often exchanged by people, or downloaded from the Web, and that often provide a way for the attacker to compromise the user's desktop system.
Show HN: I built a simple app to create PDF invoice (invoicepdf.dev)
Great tool for freelancers, small businesses, and anyone who needs to create professional invoices.
Using Pandoc and Typst to Produce PDFs (imaginarytext.ca)
I recently responded to someone on Mastodon who asked about producing decent-looking PDFs from markdown. I replied eagerly with “OMG Typst!” and linked to my earlier blogpost about developing an entire book layout template for Pandoc and Typst. The response I then received was this was “far beyond” what they need – and on reflection I had to admit that my blog post was a bit, well, niche.
Windows 11 Security Book [pdf] (microsoft.com)
Show HN: CLI to export Markdown to PDF using Jinja2 templates (github.com/andy-verstraeten)
Mdexport is CLI tool to publish Markdown files as PDF using Jinja2 templates. You can use Frontmatter metadata as custom values to be filled into your template.
PgPDF: Pdf Type and Functions for Postgres (github.com/Florents-Tselai)
This extension for PostgreSQL provides a pdf data type.
Show HN: Kis.tools – A directory of tools that work (kis.tools)
PDFgear is a breath of fresh air in the PDF tools landscape. It offers advanced features completely free while respecting user privacy with local processing in desktop apps. It's wonderful to find a comprehensive PDF tool that keeps things simple and genuinely free - we hope it stays this way for a long time!
Tell HN: macOS Sequoia Preview app adds random passwords to PDFs upon Saving (ycombinator.com)
This bug was originally fixed but it's back again: https://apple.stackexchange.com/questions/436596/preview-automatically-password-protects-pdf-when-saving
Chunkr – Vision model based PDF chunking (github.com/lumina-ai-inc)
We're Lumina. We've built a search engine that's 5x more relevant than Google Scholar. You can check us out at lumina.sh. We achieved this by bringing state of the art search technology (the best in dense and sparse vector embeddings) to academic research.
The Origins of PostScript [pdf] (gwern.net)
Show HN: A pretty decent PDF to CSV converter (mightymerge.io)
Convert PDF to CSV online for free
Toyota Maru (1990) [pdf] (ayuba.fr)
Show HN: IPA, a GUI for exploring inner details of PDFs (github.com/seekbytes)