Hacker News with Generative AI: Database

The missing tier for query compilers (scattered-thoughts.net)
Database query engines used to be able to assume that disk latency was so high that the overhead of interpreting the query plan didn't matter. Unfortunately these days a cheap nvme ssd can supply data much faster than a query interpreter can process it.
Database Release and End-to-End Testing: ClickHouse Database Cloning (notion.site)
How bloom filters made SQLite 10x faster (avi.im)
This is the fascinating story of how researchers used Bloom filters cleverly to make SQLite 10x faster for analytical queries.
Reads Causing Writes in Postgres (jesipow.com)
It is good practice to regularly inspect the statements running in the hot path of your Postgres instance. One way to do this is to examine the pg_stat_statements view, which shows various statistics about the SQL statements executed by the Postgres server.
ScyllaDB – Why We're Moving to a Source Available License (scylladb.com)
ScyllaDB has decided to focus on a single release stream – ScyllaDB Enterprise. Starting with the ScyllaDB Enterprise 2025.1 release (ETA February 2025):
VectorChord: Store 400k Vectors for $1 in PostgreSQL (pgvecto.rs)
We’re pleased to announce our new vector search extension for PostgreSQL, providing a highly cost-effective way to manage large vectors. Using VectorChord, you can achieve a QPS of 131 with 0.95 precision on 100 million 768-dimensional vectors for the top 10 queries. This setup costs only $250 monthly and can be hosted on a single machine.
Supabase AI Assistant v2 (supabase.com)
Today we are releasing Supabase Assistant v2 in the Dashboard - a global assistant with several new abilities:
Introducing integrated inference: Embed, rerank, and retrieve data with one API (pinecone.io)
We’re excited to announce expanded inference capabilities alongside our core vector database to make it even easier and faster to build high-quality, knowledgeable AI applications with Pinecone.
Show HN: WeSQL – An Innovative MySQL That Stores All Data on S3 (github.com/wesql)
WeSQL is an innovative MySQL distribution that adopts a compute-storage separation architecture, with storage backed by S3 (and S3-compatible systems). It can run on any cloud, ensuring no vendor lock-in.
Sqlpkg – The SQLite Extension Hub (sqlpkg.org)
Find SQLite extensions using the search box above. You can download and install them manually, or use the sqlpkg package manager.
Pg_karnak: Transactional schema migration across tenant databases (thenile.dev)
When we need to describe Nile in a single sentence, we say "PostgreSQL re-engineered for multi-tenant apps".
FQL: A KV Query Language (github.com/janderland)
FQL provides a query language and an alternative client API for Foundation DB.
Steps in Writing Analytical SQL Queries (crunchydata.com)
It is never immediately obvious how to go from a simple SQL query to a complex one -- especially if it involves intricate calculations.
WASM SQLite made Notion 30% Faster (betterstack.com)
Notion made a tough bet on emerging tech, and it paid off in a big way
PgPDF: Pdf Type and Functions for Postgres (github.com/Florents-Tselai)
This extension for PostgreSQL provides a pdf data type.
Zero Downtime Django Deployments with Multistep Database Changes – Johnny Metz (johnnymetz.com)
Preventing downtime during deployments is crucial for maintaining service availability and ensuring a positive user experience. Blue-green deployments have emerged as a popular strategy to achieve this goal. However, they introduce challenges, especially when dealing with database changes. This article delves into what blue-green deployments are, why database changes can be tricky in this context, and how to navigate common change scenarios effectively in Django.
Pg_analytics has switched to PostgreSQL license (github.com/paradedb)
pg_analytics (formerly named pg_lakehouse) puts DuckDB inside Postgres. With pg_analytics installed, Postgres can query foreign object stores like AWS S3 and table formats like Iceberg or Delta Lake. Queries are pushed down to DuckDB, a high performance analytical query engine.
35% Faster Than the Filesystem (2017) (sqlite.org)
SQLite reads and writes small blobs (for example, thumbnail images) 35% faster¹ than the same blobs can be read from or written to individual files on disk using fread() or fwrite().
Graph Retrieval Using Postgres Recursive CTEs (sheshbabu.com)
Did you know you can use Postgres as a graph database for certain usecases?
Show HN: Greenmask 0.2 – Database anonymization tool (github.com/GreenmaskIO)
Greenmask is a powerful open-source utility that is designed for logical database backup dumping, anonymization, synthetic data generation and restoration.
Show HN: PostgREST API in Rust (Supports PG/MySQL/SQLite/ClickHouse) (github.com/subzerocloud)
subZero is a fast, Rust-powered library designed to simplify building REST APIs and backend services.
Insights after 11 years with Datomic [video] (youtube.com)
pgroll: PostgreSQL zero-downtime migrations made easy (github.com/xataio)
pgroll is an open source command-line tool that offers safe and reversible schema migrations for PostgreSQL by serving multiple schema versions simultaneously.
Building RAG with Postgres (anyblockers.com)
Postgres is a powerful tool for implementing Retrieval-Augmented Generation (RAG) systems. Its versatility and robustness make it an excellent choice for this task. By diving deep into a technology you’re already familiar with, you can experience a significant productivity boost. As the saying goes, “stick with the tools you know.” Using Postgres for RAG allows you to reason about the system more easily, cutting through the hype and focusing on building something great.
Hybrid Search with PostgreSQL and Pgvector (jkatz05.com)
A key metric when evaluating vector similarity search algorithms is “recall” - which measures the relevancy of the returned search results.
SQLite Transparent Compression (github.com/phiresky)
Extension for sqlite that provides transparent dictionary-based row-level compression for sqlite. This basically allows you to compress entries in a sqlite database almost as well as if you were compressing the whole DB file, but while retaining random access.
K1 Buys MariaDB (prnewswire.com)
K1 Investment Management, LLC ("K1"), one of the largest investors in small-cap enterprise software companies, today announced the completion of its tender offer to acquire 100 percent of the issued ordinary shares of MariaDB plc (NYSE: MRDB) ("MariaDB"), a leader in enterprise database solutions.
Beyond logical replication: pg_easy_replicate Supports Tracking DDL Changes (shayon.dev)
Launching Hasura DDN: the first metadata-driven data access layer (hasura.io)
BonsaiKV: Fast, Scalable Persistent KV Stores with Tiered, Heterogeneous Memory [pdf] (vldb.org)