Courses Challenges

Parquet, Part 2: Indexing, Encryption, and Engines

Indexing, predicate pushdown, encryption, the Variant type, and engine integrations.

Part 2 of the Parquet deep-dive. Picks up where Part 1 ends: row-group and page statistics, the page index, bloom filters, end-to-end predicate pushdown, modular encryption (column- and footer-level), the Variant type and shredding, and how Parquet plugs into Spark, Iceberg, Delta Lake, DuckDB, and Arrow. Includes Docker labs.

Advanced13 chapters· 3h 22m· in Storage & File Formats

Explore this course on a real file in the Parquet Viewer. Drop any .parquet — or load the built-in sample — to see the schema, row groups, encodings, compression and statistics these lessons describe, 100% in your browser. Open the tool →

Course content

Prerequisites

↗Parquet, Part 1: Layout, Types, and Encodings

Read the first chapter free

Start reading now — no account required for the free chapters.

Start: Statistics: Min, Max, Null Count, and Distinct Count →More in Storage & File Formats