Capstone: End-to-End PII Pipeline
Ship the full pipeline: raw -> detect -> mask -> govern -> store in Iceberg, then handle a complete GDPR erasure cycle end to end.
Build the complete PII pipeline from scratch: raw ingestion through detection, masking, governance, Iceberg storage, and subject erasure, wiring together Spark, Presidio, and Iceberg into one system.
Course content
Prerequisites
Read the first chapter free
Start reading now — no account required for the free chapters.