Designing OLAP Systems
Engine-level architecture decisions, assuming modeling theory from S7. The core pre-aggregation-vs-raw tradeoff, choosing rollup granularity, materialized-view strategies and their consistency models, denormalization on columnar engines, late-arriving data and restatements, upsert architectures across Pinot/ClickHouse/Iceberg, designing for federation, designing for real-time with per-layer latency budgets, multi-engine architectures, and plotting cost-vs-latency curves. Ends with a capstone designing three valid OLAP architectures for one workload.
Course content
- 01What "Designing an OLAP System" MeansFree
- 02Pre-Aggregation vs Raw - The Core Tradeoff🔒
- 03Choosing Aggregation Granularity🔒
- 04Materialized View Strategies Across Engines🔒
- 05Denormalization Decisions on Columnar Engines🔒
- 06Late-Arriving Data & Restatements🔒
- 07Upsert Architectures🔒
- 08Designing for Federation🔒
- 09Designing for Real-Time🔒
- 10Multi-Engine Architectures🔒
- 11Cost vs Latency Curves🔒
- 12Capstone - Design Three OLAP Architectures🔒
Prerequisites
Read the first chapter free
Start reading now — no account required for the free chapters.