Logical Replication from Postgres to Iceberg
(crunchydata.com)
Operational and analytical workloads have historically been handled by separate database systems, though they are starting to converge. We built Crunchy Data Warehouse to put PostgreSQL at the frontier of analytics systems, using modern technologies like Iceberg and a hybrid query engine.
Operational and analytical workloads have historically been handled by separate database systems, though they are starting to converge. We built Crunchy Data Warehouse to put PostgreSQL at the frontier of analytics systems, using modern technologies like Iceberg and a hybrid query engine.
OLAP Hierarchical Aggregation with DuckDB SQL Recursive Common Table Expressions
(medium.com)
Aggregation for dimensional hierarchies doesn’t require costly Business Intelligence (BI) tools. You can use recursive SQL techniques to express your hierarchical data in relational form, allowing for easy and fast aggregation along multiple levels and dimensions.
Aggregation for dimensional hierarchies doesn’t require costly Business Intelligence (BI) tools. You can use recursive SQL techniques to express your hierarchical data in relational form, allowing for easy and fast aggregation along multiple levels and dimensions.
Apache iceberg the Hadoop of the modern-data-stack?
(det.life)
In the early 2010s, Apache Hadoop dominated the big data conversation. Organizations raced to adopt it, seeing it as the cornerstone for scalable, distributed storage and processing. Today, Apache Iceberg is emerging as a cornerstone for data lakes and lakehouses in the modern data stack.
In the early 2010s, Apache Hadoop dominated the big data conversation. Organizations raced to adopt it, seeing it as the cornerstone for scalable, distributed storage and processing. Today, Apache Iceberg is emerging as a cornerstone for data lakes and lakehouses in the modern data stack.
Use Cases for ChDB, a Powerful In-Memory OLAP SQL Engine
(runportcullis.co)
Clickhouse is quickly becoming a crowd favorite real-time data warehouse platform for organizations looking to take advantage of blazing fast query speeds in OLAP scenarios that power mission-critical applications and embedded analytics.
Clickhouse is quickly becoming a crowd favorite real-time data warehouse platform for organizations looking to take advantage of blazing fast query speeds in OLAP scenarios that power mission-critical applications and embedded analytics.
From Zero to Terabytes: Building SaaS Analytics with ClickHouse
(crisp.chat)
At Crisp, we help businesses manage all their customer conversations in one place—whether through chat, email, WhatsApp, or other channels - through a help desk platform. As our customers' needs grew, they asked for more detailed insights into their customer support, like response times and team performance.
At Crisp, we help businesses manage all their customer conversations in one place—whether through chat, email, WhatsApp, or other channels - through a help desk platform. As our customers' needs grew, they asked for more detailed insights into their customer support, like response times and team performance.
DataChain: DBT for Unstructured Data
(github.com/iterative)
DataChain is a modern Pythonic data-frame library designed for artificial intelligence.
DataChain is a modern Pythonic data-frame library designed for artificial intelligence.
Dbt – Incremental but Incomplete
(tobikodata.com)
Earlier this month, dbtTM launched microbatch incremental models in version 1.9, a highly requested feature since the experimental insert_by_period was introduced back in 2018. While it's certainly a step in the right direction, it has been a long time coming.
Earlier this month, dbtTM launched microbatch incremental models in version 1.9, a highly requested feature since the experimental insert_by_period was introduced back in 2018. While it's certainly a step in the right direction, it has been a long time coming.
I spent 5 hours learning how ClickHouse built their internal data warehouse
(vutr.substack.com)
My name is Vu Trinh, and I am a data engineer.
My name is Vu Trinh, and I am a data engineer.
6 Powerful Databricks Alternatives for Data Lakes and Lakehouses
(definite.app)
Databricks has established itself as a leader in the data lake and lakehouse space, offering a powerful platform for big data processing and analytics.
Databricks has established itself as a leader in the data lake and lakehouse space, offering a powerful platform for big data processing and analytics.