Backfill Strategies for Lakehouses
Partition-aware, idempotent backfills into Iceberg/Delta alongside live streams, with reconciliation.
Load history without breaking the present — idempotent writes into Iceberg/Delta, partition-aware backfills, running a backfill alongside a live stream, dual-pipeline reconciliation, watermark resets, lambda/kappa convergence, and verification.
Course content
- 01The Backfill Problem: History Without Breaking the PresentFree
- 02Idempotent Writes Into Iceberg/Delta🔒
- 03Partition-Aware Backfills🔒
- 04Running a Backfill Alongside a Live Stream🔒
- 05Dual-Pipeline Reconciliation🔒
- 06Watermark Resets & Reprocessing🔒
- 07Lambda vs Kappa, and Convergence🔒
- 08Backfilling Without Blowing the Budget🔒
- 09Verifying a Backfill: Counts, Checksums, Spot Checks🔒
- 10Capstone: Backfill Two Years of TheWorldShop Orders🔒
Prerequisites
What to learn next
Read the first chapter free
Start reading now — no account required for the free chapters.