Hacker News with Generative AI: PostgreSQL

The Internals of PostgreSQL (interdb.jp)
PostgreSQL is a well-designed, open-source multi-purpose relational database system which is widely used throughout the world.
BM25 in PostgreSQL (vectorchord.ai)
We’re excited to share something special with you: VectorChord-BM25, a new extension designed to make PostgreSQL’s full-text search even better. Whether you’re building a small app or managing a large-scale system, this tool brings advanced BM25 scoring and ranking right into PostgreSQL, making your searches smarter and faster.
Vector Search at 10k QPS in PostgreSQL (vectorchord.ai)
VectorChord is a PostgreSQL extension designed for scalable, high-performance, and disk-efficient vector similarity search, and serves as the successor to pgvecto.rs. In our previous blog post, we showed that with just $250 per month, VectorChord achieved 131 QPS with 0.95 precision on 100 million vectors—demonstrating impressive cost-effective performance for large-scale vector search.
VectorChord-BM25: PostgreSQL Search with BM25 – 3x Faster Than Elasticsearch (vectorchord.ai)
We’re excited to share something special with you: VectorChord-BM25, a new extension designed to make PostgreSQL’s full-text search even better. Whether you’re building a small app or managing a large-scale system, this tool brings advanced BM25 scoring and ranking right into PostgreSQL, making your searches smarter and faster.
Critical PostgreSQL bug tied to zero-day attack on US Treasury (theregister.com)
A high-severity SQL injection bug in the PostgreSQL interactive tool was exploited alongside the zero-day used to break into the US Treasury in December, researchers say.
Reliably replicating data between Postgres and ClickHouse (benjaminwootton.com)
In this series of articles we are going to demonstrate how to reliably replicate data from PostgreSQL to ClickHouse.
PostgreSQL Lands Self-Join Elimination Optimization (phoronix.com)
More than seven years in the making, merged yesterday for PostgreSQL is a self-join elimination "SJE" feature as a performance optimization for some queries.
Representing Graphs in PostgreSQL (richard-towers.com)
Let’s say we’ve got some graph-like data, such as a social network.
PgAssistant: OSS tool to help devs understand and optimize PG performance (github.com/nexsol-technologies)
PgAssistant is an open-source tool designed to help developers understand and optimize their PostgreSQL database performance.
Microsoft open sources PostgreSQL extensions (theregister.com)
Analysis When Microsoft rolled out an open source extension stack for PostgreSQL to handle document-style data, it wasn't just taking aim at MongoDB – the dominant NoSQL player – but also blurring the lines between relational and non-relational databases, according to one expert.
PostgreSQL Best Practices (speakdatascience.com)
PostgreSQL (Postgres) is one of the most powerful and popular relational database management systems available today. Whether you’re a database administrator, developer, or DevOps engineer, following best practices ensures optimal performance, security, and maintainability of your database systems.
Scaling with PostgreSQL without boiling the ocean (shayon.dev)
“Postgres was great when we started but now that our service is being used heavily we are running into a lot of ‘weird’ issues”
Solving Postgres' Search Limitations (paradedb.com)
We recently completed one of our biggest engineering bets to date: migrating pg_search, a Postgres extension for full text search and analytics, to Postgres' block storage system. In doing so, pg_search is the first-ever extension1 to port an external file format to Postgres block storage.
A New Postgres Block Storage Layout for Full Text Search (paradedb.com)
A Rust procedural language handler for PostgreSQL (github.com/tcdi)
PL/Rust is a loadable procedural language that enables writing PostgreSQL functions in the Rust programming language.
Doubling Down on Postgres Compatibility: YugabyteDB Levels Up with PG15 Features (yugabyte.com)
The latest release of YugabyteDB delivers 15 key PostgreSQL 15 features, providing an industry-first zero-downtime PostgreSQL upgrade and downgrade experience, giving customers a truly cloud-native PostgreSQL solution.
Mathesar – an intutive spreadsheet-like interface to Postgres data (github.com/mathesar-foundation)
Mathesar is an open source application that makes working with PostgreSQL databases both simple and powerful.
Microsoft builds open source document database on PostgreSQL, suggests FerretDB (theregister.com)
Microsoft has launched a document database platform constructed on a relational PostgreSQL back end.
DocumentDB: Open-Source Announcement (microsoft.com)
We are excited to announce the official release of DocumentDB—an open-source document database platform and the engine powering the vCore-based Azure Cosmos DB for MongoDB, built on PostgreSQL.
DocumentDB: BSON data type CRUD in PostgreSQL (github.com/microsoft)
DocumentDB offers a native implementation of document-oriented NoSQL database, enabling seamless CRUD operations on BSON data types within a PostgreSQL framework.
Supercharge vector search with ColBERT rerank in PostgreSQL (vectorchord.ai)
Traditional vector search methods typically employ sentence embeddings to locate similar content. However, generating sentence embeddings through pooling token embeddings can potentially sacrifice fine-grained details present at the token level. ColBERT overcomes this by representing text as token-level multi-vectors rather than a single, aggregated vector. This approach, leveraging contextual late interaction at the token level, allows ColBERT to retain more nuanced information and improve search accuracy compared to methods relying solely on sentence embeddings.
Microsoft/documentdb: DocumentDB – NoSQL document database in Postgres (github.com/microsoft)
DocumentDB offers a native implementation of document-oriented NoSQL database, enabling seamless CRUD operations on BSON data types within a PostgreSQL framework.
DocumentDB: Open-Source Announcement (microsoft.com)
We are excited to announce the official release of DocumentDB—an open-source document database platform and the engine powering the vCore-based Azure Cosmos DB for MongoDB, built on PostgreSQL.
PostgreSQL is the Database Management System of the Year 2024 (db-engines.com)
DB-Engines is today announcing that PostgreSQL is our DBMS of the Year for the second year in a row, winning for the fifth time overall after also being top-ranked in 2017, 2018, 2019, and 2023. Second in the rankings was Snowflake, followed by Microsoft in third place. PostgreSQL has emerged as the most popular database management system over the past year, outpacing all other 423 monitored systems.
PostgreSQL Anonymizer (readthedocs.io)
PostgreSQL Anonymizer is an extension to mask or replace personally identifiable information (PII) or commercially sensitive data from a Postgres database.
Apache AGE: Graph Database for PostgreSQL (apache.org)
PostgreSQL Support for Certificate Transparency Logs Now Available (transparency.dev)
Certificate Transparency logs can now leverage PostgreSQL’s reliability and performance, giving log operators additional storage choice and flexibility.
Back to basics: Why we chose long-polling over websockets (inferable.ai)
Like many teams building real-time systems with Node.js and TypeScript, we've been exploring ways to handle real-time updates at scale. Our system handles hundreds of worker nodes constantly polling our PostgreSQL-backed control plane for new jobs (tool calls issued by agents), while agents themselves continuously pull for execution and chat state updates. What started as an exploration into WebSockets led us to a surprisingly effective "old-school" solution: HTTP long polling with Postgres.
Show HN: Postgres-SQLite: SQLite as first-class types in Postgres (github.com/michelp)
postgres-sqlite is a PostgreSQL extension that introduces SQLite databases as a first-class data type within PostgreSQL.
Using watermarks to coordinate change data capture in Postgres (sequinstream.com)
In change data capture, consistency is paramount. A single missing or duplicate message can cascade into time-consuming bugs and erode trust in your entire system. The moment you find a record missing in the destination, you have to wonder: is this the only one? How many others are there?