Strata 5

Orchestration & Pipelines

A DAG half-succeeded at 3am: some tasks ran twice, a backfill double-counted revenue, and now you are reconciling numbers instead of sleeping.

This stratum names the failure modes before the tools: the Airflow scheduler loop and why a task ran (or didn't, or twice), idempotency and state, backfills that don't double-count, freshness SLAs that page for the right reasons, and the gap between a green pipeline and correct data.

What you'll learn

  • Explain the Airflow scheduler loop and why a task ran, skipped, or duplicated
  • Design idempotent tasks and backfills that survive partial failure
  • Distinguish pipeline health from data quality — and alert on the right one
  • Run Airflow in production: deploys, secrets, logging, multi-tenancy, and cost

Tracks & courses

Full navigation is in the sidebar. Here's what each track gives you and the courses inside it.

Related topics

Start Orchestration & Pipelines free

The first chapters of every course are free to read — no account needed.