Lesson 1
Introduction to Stream Processing
In this lesson students will learn what data streaming is. Students will learn the pros and cons of data streaming, and how it compares to traditional data strategies.
Course
Learn to use REST Proxy, Kafka Connect, KSQL, and Faust Python Stream Processing and use it to stream public transit statuses using Kafka and Kafka ecosystem to build a stream processing application that shows the status of trains in real-time.
Learn to use REST Proxy, Kafka Connect, KSQL, and Faust Python Stream Processing and use it to stream public transit statuses using Kafka and Kafka ecosystem to build a stream processing application that shows the status of trains in real-time.
4 weeks
Real-world Projects
Completion Certificate
Last Updated April 25, 2022
No experience required
Lesson 1
Introduction to Stream Processing
In this lesson students will learn what data streaming is. Students will learn the pros and cons of data streaming, and how it compares to traditional data strategies.
Lesson 2
Apache Kafka
In this lesson we’ll review the architecture and configuration of Apache Kafka.
Lesson 3
Data Schemas and Apache Avro
This lesson covers data schemas and data schema management, with a focus on Apache Avro.
Lesson 4
Kafka Connect and REST Proxy
This lesson covers producing and consuming data into Kafka with Kafka Connect and REST Proxy.
Lesson 5
Stream Processing Fundamentals
Learn to build real-time applications that instantly process events, the concepts of stream processing state storage, windowed processing, and stateful and non-stateful stream processing.
Lesson 6
Stream Processing with Faust
Students will learn how to use the Python stream processing library Faust to rapidly create powerful stream processing applications.
Lesson 7
KSQL
Learn how to write simple SQL queries to turn Kafka topics into KSQL streams and tables, and then write those tables back out to Kafka.
Lesson 8 • Project
Optimizing Public Transportation
For your first project, you’ll be streaming public transit status using Kafka and the Kafka ecosystem to build a stream processing application that shows the status of trains in real-time.
Ben Goldberg
Staff Engineer at SpotHero
In his career as an engineer, Ben Goldberg has worked in fields ranging from computer vision to natural language processing. At SpotHero, he founded and built out their data engineering team, using Airflow as one of the key technologies.
Ben Goldberg
Staff Engineer at SpotHero
In his career as an engineer, Ben Goldberg has worked in fields ranging from computer vision to natural language processing. At SpotHero, he founded and built out their data engineering team, using Airflow as one of the key technologies.
Get Started Today