Hacker News with Generative AI: ClickHouse

Show HN: Houseplant – Database Migrations for ClickHouse (readthedocs.io)
Houseplant is a CLI tool that helps you manage database migrations for ClickHouse.
Show HN: PDF2MD – Rust+Redis+ClickHouse+VLLM conversion pipeline for PDFs (github.com/devflowinc)
Building Observability with ClickHouse (cmtops.dev)
This is a summary of my experience in searching for and implementing an optimal observability stack at my workplace.
Show HN: Open-source Kibana alternative for logs and traces in ClickHouse (github.com/hyperdxio)
HyperDX helps engineers quickly figure out why production is broken by making it easy to search & visualize logs and traces on top of any Clickhouse cluster (imagine Kibana, for Clickhouse).
JIT in ClickHouse (2022) (clickhouse.com)
In this post, I will describe what JIT compilation is, how LLVM infrastructure can be used for JIT compilation, and how JIT compilation works in ClickHouse.
From Zero to Terabytes: Building SaaS Analytics with ClickHouse (crisp.chat)
At Crisp, we help businesses manage all their customer conversations in one place—whether through chat, email, WhatsApp, or other channels - through a help desk platform. As our customers' needs grew, they asked for more detailed insights into their customer support, like response times and team performance.
Show HN: ClickHouse User Defined Table Function in Python (github.com/auxten)
I'm a SQL enthusiast and I want to query everything with SQL. This project is a collection of my experiments to query different data sources with SQL.
A new JSON data type for ClickHouse (clickhouse.com)
JSON has become the lingua franca for handling semi-structured and unstructured data in modern data systems. Whether it’s in logging and observability scenarios, real-time data streaming, mobile app storage, or machine learning pipelines, JSON’s flexible structure makes it the go-to format for capturing and transmitting data across distributed systems.
Show HN: Clickhttp: ClickHouse via HTTP Protocol (github.com/0xMihalich)
Working with Clickhouse Database via HTTP Protocol | Работа с БД Clickhouse по HTTP-протоколу
Unraveling the Mystery of Idle Threads in ClickHouse (altinity.com)
ClickHouse is renowned for its lightning-fast analytics and ability to handle large-scale queries. However, like any high-performance database, it can run into challenges under heavy workloads. Imagine running ClickHouse on a well-provisioned machine—ample cores, plenty of memory—only to notice that during peak loads, performance starts to degrade. You check the usual suspects: CPU, disk, network… and everything seems fine. Yet queries seem to “hang,” with even the simplest SELECT 1 taking as long as 10 seconds to execute. What’s going on?
Alert Evaluations: Incremental Merges in ClickHouse (highlight.io)
At Highlight, we rely on ClickHouse, an open-source columnar database built for handling large datasets and real-time analytics.
I spent 5 hours learning how ClickHouse built their internal data warehouse (vutr.substack.com)
My name is Vu Trinh, and I am a data engineer.
ClickHouse Data Modeling for Postgres Users (clickhouse.com)
Last month, we acquired PeerDB, a company that specializes in Postgres CDC. PeerDB makes it fast and simple to replicate data from Postgres to ClickHouse. A common question from PeerDB users is how to model their data in ClickHouse after the replication process to maximize the benefits of ClickHouse.
Show HN: Storing and Analyzing 160 billion Quotes in ClickHouse (rafalkwasny.com)
ClickHouse acquires PeerDB for native Postgres CDC integration (peerdb.io)
ClickHouse acquires PeerDB to expand its Postgres support (techcrunch.com)
Materialized views in ClickHouse: The data transformation Swiss Army knife (propeldata.com)
Optimizing ClickHouse: Tactics that worked for us (highlight.io)
Simple Postgres to ClickHouse replication featuring MinIO (peerdb.io)
ClickHouse JOINs 100x Faster (tinybird.co)
Using ClickHouse to scale an events engine (github.com/getlago)