Petascale Labs
The PlatformSimulation ArcadeLibraryToolsPricing
Curriculum
  • Storage & File Formats
  • Ingestion & Transport
    Apache Kafka: Fundamentals
    • 01Why Streaming? The Problem Kafka SolvesFree
    • 02Kafka's Mental Model: Topics, Partitions, Offsets🔒
    • 03Brokers, Clusters & Replication🔒
    • 04Your First Producer🔒
    • 05Your First Consumer & Consumer Groups🔒
    • 06Keys, Partitioning & Ordering Guarantees🔒
    • 07Delivery Semantics: At-Most-Once, At-Least-Once, Exactly-Once🔒
    • 08Serialization: JSON, Avro & the Schema Registry🔒
    • 09Retention, Compaction & Cleanup Policies🔒
    • 10Kafka Connect: Moving Data In and Out Without Code🔒
    • 11Operating Kafka Locally with Docker🔒
    • 12Reading the Spark Kafka Source🔒
    • 13Common Pitfalls: Hot Partitions, Rebalance Storms, Lag🔒
    • 14Capstone: Build a Clickstream Pipeline for TheWorldShop🔒
    Apache Kafka: Internals & Protocol Deep Dive
    Apache Kafka: Operations, Performance & Reliability
  • Open Table Formats
  • Compute Engines
  • Orchestration & Pipelines
  • PII & Data Governance
  • Query Engines & OLAP
  • Semantic & Metrics Layer
CoursesChallenges
  1. Home/
  2. Curriculum/
  3. Ingestion & Transport/
  4. Apache Kafka: Fundamentals

Apache Kafka: Fundamentals

Topics, partitions, producers, consumers, delivery semantics, schemas, and Kafka Connect.

Learn how data moves at scale — topics, partitions, producers, consumers, delivery semantics, schemas, and Kafka Connect — by building real streaming pipelines for an e-commerce platform.

Foundations14 chapters· 4h 40m· in Ingestion & Transport

Course content

  1. 01Why Streaming? The Problem Kafka SolvesFree
  2. 02Kafka's Mental Model: Topics, Partitions, Offsets🔒
  3. 03Brokers, Clusters & Replication🔒
  4. 04Your First Producer🔒
  5. 05Your First Consumer & Consumer Groups🔒
  6. 06Keys, Partitioning & Ordering Guarantees🔒
  7. 07Delivery Semantics: At-Most-Once, At-Least-Once, Exactly-Once🔒
  8. 08Serialization: JSON, Avro & the Schema Registry🔒
  9. 09Retention, Compaction & Cleanup Policies🔒
  10. 10Kafka Connect: Moving Data In and Out Without Code🔒
  11. 11Operating Kafka Locally with Docker🔒
  12. 12Reading the Spark Kafka Source🔒
  13. 13Common Pitfalls: Hot Partitions, Rebalance Storms, Lag🔒
  14. 14Capstone: Build a Clickstream Pipeline for TheWorldShop🔒

What to learn next

↗Apache Kafka: Internals & Protocol Deep Dive· next

Read the first chapter free

Start reading now — no account required for the free chapters.

Start: Why Streaming? The Problem Kafka Solves →More in Ingestion & Transport
Petascale Labs

The physics layer of data

From byte-level storage to business-grade metrics. Built with depth, not breadth.

Curriculum

All strataStorage & File FormatsIngestion & TransportOpen Table FormatsCompute EnginesOrchestration & PipelinesQuery Engines & OLAPSemantic & Metrics LayerPII & Data Governance

Tools

All toolsParquet ViewerFreeSCD PlaygroundFree

Company

AboutContact

Legal

Privacy PolicyTerms of ServiceCookie Policy

Email

hello@petascalelabs.com

Support

support@petascalelabs.com

Company

Petascale Labs, Inc.

© 2026 Petascale Labs, Inc. All rights reserved.

PrivacyTermsCookiesContact