Petascale Labs
The PlatformSimulation Arcade
RoadmapCoursesChallengesTopicsToolsFree
PricingBlog
  1. Home/
  2. Topics/
  3. Data governance
Topic

Data governance

Data governance shows up across 3 courses in 1 layerof the data platform stack. Here's where it's taught, a free way to practice it, and what to learn next.

Where it's taught

🔐PII & Data Governance

PII Fundamentals

Identify, classify, and manage PII across your data stack — from direct identifiers to quasi-identifiers hiding in plain sight.

7 ch

1 free

Iceberg PII Lifecycle

Retention policies, partition and snapshot expiration, orphan-file cleanup, and an automated, verifiable PII purge pipeline in Iceberg.

7 ch · 2h 50m

1 free

Capstone: End-to-End PII Pipeline

Ship the full pipeline: raw -> detect -> mask -> govern -> store in Iceberg, then handle a complete GDPR erasure cycle end to end.

8 ch · 3h 35m

1 free

Related topics

↗apache iceberg↗apache spark↗data classification↗data inventory↗data masking↗data purge↗data retention↗PHI↗PII↗PII detection

Start learning data governance free

The first chapter of every course is free to read — no account needed.

Start: PII Fundamentals →All strata
Petascale Labs

The physics layer of data

From byte-level storage to business-grade metrics. Built with depth, not breadth.

Curriculum

Data Engineer RoadmapAll strataStorage & File FormatsIngestion & TransportOpen Table FormatsCompute EnginesOrchestration & PipelinesQuery Engines & OLAPSemantic & Metrics LayerPII & Data Governance

Tools

All toolsParquet ViewerFreeSCD PlaygroundFreePII Masking GeneratorFree

Company

AboutBlogContact

Legal

Privacy PolicyTerms of ServiceCookie Policy

Email

hello@petascalelabs.com

Support

support@petascalelabs.com

Company

Petascale Labs, Inc.

© 2026 Petascale Labs, Inc. All rights reserved.

PrivacyTermsCookiesContact