🏠 ClickHouse Internals

ClickHouse is a column-oriented OLAP database designed for blazing-fast analytical queries. These visualizations help you understand how MergeTree stores data, how merges work, and how to tune for your workload.

✦ Live

MergeTree Engine

Parts, merges, sparse indexes, and column-oriented storage

✦ Live

Distributed Tables

Sharding, replication, and cross-shard queries

✦ Live

Materialized Views

Real-time aggregation pipelines and projection chains

✦ Live

Query Execution Pipeline

Parsing, processor DAG, parallel reads, two-level aggregation, and backpressure

✦ Live

Data Compression

Codec stacking, per-column compression, and storage efficiency

✦ Live

Primary Keys & Indexing

Sparse indexes, granules, ORDER BY vs PRIMARY KEY, compound keys, and data skipping indexes

✦ Live

Partitioning

How PARTITION BY splits data into parts, partition pruning, granularity trade-offs, and lifecycle with TTL

✦ Live

Joins & Dictionaries

Hash joins, external dictionaries, and lookup strategies

✦ Live

Dictionaries Deep Dive

In-memory key-value lookups β€” replace JOINs with O(1) dictGet() calls from external sources

✦ Live

Data Ingestion

Batch inserts, async writes, Kafka engine & streaming patterns

✦ Live

ReplicatedMergeTree

ZooKeeper coordination, insert deduplication, and quorum writes

✦ Live

Mutations & ALTER

How UPDATE & DELETE actually work β€” part rewrites, mutation queues, and lightweight deletes

✦ Live

ClickHouse vs BigQuery

Open-source columnar powerhouse vs Google's serverless warehouse β€” latency, cost, Dremel internals, and decision guide

✦ Live

ClickHouse vs Snowflake

Open-source speed demon vs cloud warehouse giant β€” architecture, performance, cost, and when to use each

✦ Live

ClickHouse vs DuckDB

Architecture comparison, benchmarks, and when to use each columnar engine

✦ Live

ClickHouse vs TimescaleDB

Columnar OLAP vs PostgreSQL time-series β€” storage, compression, query benchmarks, and decision guide

✦ Live

ClickHouse vs Elasticsearch

Log analytics showdown β€” storage, query performance, cost, and migration guide from ELK

✦ Live

ClickHouse vs Apache Druid

Real-time OLAP showdown β€” storage formats, ingestion pipelines, query models, and when to choose each

✦ Live

ReplacingMergeTree

Deduplication, version columns, FINAL keyword, and mutable data patterns in ClickHouse

Coming soon

Performance Tuning

Profiling queries, system tables, and optimization patterns