Designing OLAP Systems

Engine-level architecture decisions, assuming modeling theory from S7. The core pre-aggregation-vs-raw tradeoff, choosing rollup granularity, materialized-view strategies and their consistency models, denormalization on columnar engines, late-arriving data and restatements, upsert architectures across Pinot/ClickHouse/Iceberg, designing for federation, designing for real-time with per-layer latency budgets, multi-engine architectures, and plotting cost-vs-latency curves. Ends with a capstone designing three valid OLAP architectures for one workload.

Advanced12 chapters· 3h 25m· in Query Engines & OLAP

Course content

  1. 01What "Designing an OLAP System" MeansFree
  2. 02Pre-Aggregation vs Raw - The Core Tradeoff🔒
  3. 03Choosing Aggregation Granularity🔒
  4. 04Materialized View Strategies Across Engines🔒
  5. 05Denormalization Decisions on Columnar Engines🔒
  6. 06Late-Arriving Data & Restatements🔒
  7. 07Upsert Architectures🔒
  8. 08Designing for Federation🔒
  9. 09Designing for Real-Time🔒
  10. 10Multi-Engine Architectures🔒
  11. 11Cost vs Latency Curves🔒
  12. 12Capstone - Design Three OLAP Architectures🔒

Prerequisites

Read the first chapter free

Start reading now — no account required for the free chapters.