Udacity Logo
Log InSign Up

Data Ingestion with Kafka & Kafka Streaming

Course

Learn to use REST Proxy, Kafka Connect, KSQL, and Faust Python Stream Processing and use it to stream public transit statuses using Kafka and Kafka ecosystem to build a stream processing application that shows the status of trains in real-time.

Learn to use REST Proxy, Kafka Connect, KSQL, and Faust Python Stream Processing and use it to stream public transit statuses using Kafka and Kafka ecosystem to build a stream processing application that shows the status of trains in real-time.

4 weeks

Real-world Projects

Completion Certificate

Last Updated April 25, 2022

Prerequisites:

No experience required

Course Lessons

Lesson 1

Introduction to Stream Processing

In this lesson students will learn what data streaming is. Students will learn the pros and cons of data streaming, and how it compares to traditional data strategies.

Lesson 2

Apache Kafka

In this lesson we’ll review the architecture and configuration of Apache Kafka.

Lesson 3

Data Schemas and Apache Avro

This lesson covers data schemas and data schema management, with a focus on Apache Avro.

Lesson 4

Kafka Connect and REST Proxy

This lesson covers producing and consuming data into Kafka with Kafka Connect and REST Proxy.

Lesson 5

Stream Processing Fundamentals

Learn to build real-time applications that instantly process events, the concepts of stream processing state storage, windowed processing, and stateful and non-stateful stream processing.

Lesson 6

Stream Processing with Faust

Students will learn how to use the Python stream processing library Faust to rapidly create powerful stream processing applications.

Lesson 7

KSQL

Learn how to write simple SQL queries to turn Kafka topics into KSQL streams and tables, and then write those tables back out to Kafka.

Lesson 8 • Project

Optimizing Public Transportation

For your first project, you’ll be streaming public transit status using Kafka and the Kafka ecosystem to build a stream processing application that shows the status of trains in real-time.

Taught By The Best

Photo of Ben Goldberg

Ben Goldberg

Staff Engineer at SpotHero

In his career as an engineer, Ben Goldberg has worked in fields ranging from computer vision to natural language processing. At SpotHero, he founded and built out their data engineering team, using Airflow as one of the key technologies.

Taught By The Best

Photo of Ben Goldberg

Ben Goldberg

Staff Engineer at SpotHero

In his career as an engineer, Ben Goldberg has worked in fields ranging from computer vision to natural language processing. At SpotHero, he founded and built out their data engineering team, using Airflow as one of the key technologies.

Get Started Today

Data Ingestion with Kafka & Kafka Streaming