Courses Challenges

Apache Spark: Fundamentals

RDDs, DataFrames, Spark SQL, joins, window functions, and production batch pipelines.

Learn distributed computing from scratch — RDDs, DataFrames, Spark SQL, joins, window functions, and deploying production batch pipelines with Apache Spark.

Foundations20 chapters· 6h 40m· in Compute Engines

What to learn next

↗Apache Spark: Advanced Internals· next

Read the first chapter free

Start reading now — no account required for the free chapters.

Start: Why Distributed Computing? →More in Compute Engines