Hacker News with Generative AI: Databases

When parameterization fails: SQL injection in Nim using parameterized queries (nns.ee)
I discovered a potential SQL injection vulnerability in Nim's standard library module db_postgres.
Show HN: SQLite? Postgres? Use both with omni_SQLite (omnigres.org)
omni_sqlite is an extension that adds the capability to use SQLite databases as a first-class data type within Postgres. It uses expandable datum technique to ensure no additional serialization overhead for data in-flight.
Stop syncing everything (sqlsync.dev)
Partial replication sounds easy—just sync the data your app needs, right? But choosing an approach is tricky: logical replication precisely tracks every change, complicating strong consistency, while physical replication avoids that complexity but requires syncing every change, even discarded ones. What if your app could combine the simplicity of physical replication with the efficiency of logical replication? That’s the key idea behind Graft, the open-source transactional storage engine I’m launching today.
Turso SQLite Offline Sync Public Beta (turso.tech)
We're excited to announce that Turso Offline Sync is now available in public beta!
Dragonfly: Modern Replacement for Redis and Memcached (github.com/dragonflydb)
Dragonfly is an in-memory data store built for modern application workloads.
Show HN: I built an open-source NotebookLM alternative using Morphik (github.com/morphik-org)
Morphik is an open-source database designed for AI applications that simplifies working with unstructured data. It provides advanced RAG (Retrieval Augmented Generation) capabilities with multi-modal support, knowledge graphs, and intuitive APIs.
Ask HN: Code should be stored in a database. Who has tried this? (ycombinator.com)
To me it seems obvious that code should be stored in a database rather than a hierarchical, text-based format.
Show HN: Cloud-Ready Postgres MCP Server (github.com/stuzero)
A Model Context Protocol (MCP) server for PostgreSQL databases with enhanced capabilities for AI agents.
Memelang: A hybrid graph-relational query language (memelang.net)
Memelang is a hybrid graph-relational query language that encodes complex relationships in a compact syntax.
Postgres Language Server: Initial Release (github.com/supabase-community)
Postgres Extension Tutorial (github.com/IshaanAdarsh)
Welcome to the Postgres Extension Tutorial. This comprehensive tutorial is designed to provide a step-by-step guide to PostgreSQL extension development.
Show HN: Searchable encryption, SQL proxy for Postgres (cipherstash.com)
Following fast on the heels of last week’s Protect.js announcement, we’re excited to introduce version 2 of CipherStash Proxy, our no-code data-protection solution for PostgreSQL.
PostgreSQL Lands Initial Support for IO_uring: "Can Be Considerably Faster" (phoronix.com)
As a very exciting improvement for the open-source PostgreSQL database server, it has merged initial support for making use of IO_uring on Linux servers for asynchronous I/O and can provide for some nice performance improvements.
New open-source benchmark for real-time analytics applications (github.com/timescale)
Faster interpreters in Go: Catching up with C++ (planetscale.com)
The SQL evaluation engine that ships with Vitess, the open-source database that powers PlanetScale, was originally implemented as an AST evaluator that used to operate directly on the SQL AST generated by our parser. Over this past year, we've gradually replaced it with a Virtual Machine which, despite being written natively in Go, performs similarly to the original C++ evaluation code in MySQL.
Reducing Cloud Spend: Migrating Logs from CloudWatch to Iceberg with Postgres (crunchydata.com)
As a database service provider, we store a number of logs internally to audit and oversee what is happening within our systems.
Sharding Pgvector (pgdog.dev)
If you find yourself working with embeddings, you’ve shopped around for a vector database. pgvector is a great option if you’re using Postgres already. Once you reach a certain scale (about a million arrays), building indices starts taking a long time. Some workarounds, like parallel workers, help, but you still need to fit the whole graph in memory.
DuckDB: Development Roadmap (duckdb.org)
The DuckDB project is governed by the non-profit DuckDB Foundation.
It's Time to Stop Building KV Databases (buttondown.com)
I'm so sick of Key-Value. The most unopinionated, uninspired data model that is a pain to use for anything.
Why PostgreSQL needs a better API for alternative table engines? (orioledb.com)
For a long time now, PostgreSQL has had an extensible Index Access Method API (called AM), which has stood the test of time and enabled numerous robust extensions to provide their own index types. For example: rum, pgvector, bloom, zombodb and others. PostgreSQL 12 introduced the Table AM API, promising equivalent flexibility for table access methods.
Open-source Rust database tops JSONBench using DataFusion (greptime.com)
GreptimeDB Takes on the Billion-JSON-Document Challenge - Outperforms ClickHouse, VictoriaLogs, and Competitors
Let's Build a Simple Database (cstack.github.io)
I’m building a clone of sqlite from scratch in C in order to understand, and I’m going to document my process as I go.
MySQL transactions per second vs. fsyncs per second (2020) (sirupsen.com)
How many transactions (‘writes’) per second is MySQL capable of?
TidesDB: A fast key/value store for modern hardware (tidesdb.com)
TidesDB is an embeddable persistant key-value storage engine.
Show HN: Minimalytics – a standalone minimal analytics app built on SQLite (github.com/nafey)
Minimalytics is a standalone minimalist analytics tool built on SQLite.
Show HN: GizmoSQL – Run DuckDB as a Server with Arrow Flight SQL (github.com/gizmodata)
This is the GizmoSQL public repo - used to provide info and artifact downloads for GizmoSQL - an Apache Arrow Flight SQL server implementation using DuckDB or SQLite as a backend database.
One Billion Row Challenge in Racket (defn.io)
I decided to have some fun tonight and work on a Racket solution to the One Billion Row Challenge.
Fauna Service Winding Down (fauna.com)
The Fauna service will be ending on May 30, 2025. For more information, read the announcement and the FAQ.
CockroachDB has changed their license, again (github.com/cockroachdb)
The real failure rate of EBS (planetscale.com)
PlanetScale has deployed millions of Amazon Elastic Block Store (EBS) volumes across the world. We create and destroy tens of thousands of them every day as we stand up databases for customers, take backups, and test our systems end-to-end. Through this experience, we have an unique viewpoint into the failure rate and mechanisms of EBS, and have spent a lot of time working on how to mitigate them.