Strata 2

Ingestion & Transport

Consumer lag spikes, a rebalance storm stalls the whole group, and you are staring at duplicate records downstream wondering whether “exactly-once” meant what you thought it meant.

This stratum builds Kafka from the fundamentals up to the wire: partitions and consumer groups, the in-sync-replica replication protocol, leader election, log compaction, and the real semantics behind at-least-once versus exactly-once — so you can trace a data-quality failure back to the broker instead of guessing.

What you'll learn

  • Explain how producers, partitions, and consumer groups distribute and order data
  • Reason about the ISR protocol, acks, and what exactly-once guarantees (and doesn't)
  • Diagnose consumer rebalancing storms and lag from first principles
  • Read Kafka's on-the-wire protocol and log-compaction behavior

Tracks & courses

Full navigation is in the sidebar. Here's what each track gives you and the courses inside it.

Track B — Change Data Capture (CDC)

Capture every insert, update, and delete from operational databases and stream it reliably — from Debezium fundamentals to outbox patterns, exactly-once sinks, and GDPR erasure at scale.

Track D — Batch Ingestion & Backfills

The often-skipped half of ingestion — connector-based batch extraction with incremental cursors and idempotent loads, plus backfill strategies that load history into a lakehouse without breaking live streams.

Track E — Architecting the Ingestion Layer

The architect capstone — design a production ingestion platform end to end: event-driven vs batch decisioning, schema registry and data contracts, topic taxonomy, multi-tenant ingest, DLQ/replay/retention policy, cost modeling, and cross-region resilience.

Related topics

Start Ingestion & Transport free

The first chapters of every course are free to read — no account needed.