Capstone: End-to-End PII Pipeline

Ship the full pipeline: raw -> detect -> mask -> govern -> store in Iceberg, then handle a complete GDPR erasure cycle end to end.

Build the complete PII pipeline from scratch: raw ingestion through detection, masking, governance, Iceberg storage, and subject erasure, wiring together Spark, Presidio, and Iceberg into one system.

Specialization8 chapters· 3h 35m· in PII & Data Governance

Course content

  1. 01Lesson 1: Architecture OverviewFree
  2. 02Lesson 2: Ingestion Layer🔒
  3. 03Lesson 3: Detection Layer🔒
  4. 04Lesson 4: Masking Layer🔒
  5. 05Lesson 5: Governance Layer🔒
  6. 06Lesson 6: Storage Layer🔒
  7. 07Lesson 7: Erasure & Lifecycle🔒
  8. 08Lesson 8: Ship the Full Pipeline - Spark + Iceberg + Presidio🔒

Prerequisites

Read the first chapter free

Start reading now — no account required for the free chapters.