Udacity Logo
Log InJoin for Free

Data Ingestion with Kafka and Kafka Streaming

Course

Learn to use REST Proxy, Kafka Connect, KSQL, and Faust Python Stream Processing and use it to stream public transit statuses using Kafka and Kafka ecosystem to build a stream processing application that shows the status of trains in real-time.

Learn to use REST Proxy, Kafka Connect, KSQL, and Faust Python Stream Processing and use it to stream public transit statuses using Kafka and Kafka ecosystem to build a stream processing application that shows the status of trains in real-time.

Advanced

4 weeks

Real-world Projects

Completion Certificate

Last Updated December 29, 2023

Skills you'll learn:
Faust • Confluent Kafka Python client • Kafka rest proxy • KSQL
Prerequisites:
Basic descriptive statistics

Course Lessons

Lesson 1

Introduction to Stream Processing

In this lesson students will learn what data streaming is. Students will learn the pros and cons of data streaming, and how it compares to traditional data strategies.

Lesson 2

Apache Kafka

In this lesson we’ll review the architecture and configuration of Apache Kafka.

Lesson 3

Data Schemas and Apache Avro

This lesson covers data schemas and data schema management, with a focus on Apache Avro.

Lesson 4

Kafka Connect and REST Proxy

This lesson covers producing and consuming data into Kafka with Kafka Connect and REST Proxy.

Lesson 5

Stream Processing Fundamentals

Learn to build real-time applications that instantly process events, the concepts of stream processing state storage, windowed processing, and stateful and non-stateful stream processing.

Lesson 6

Stream Processing with Faust

Students will learn how to use the Python stream processing library Faust to rapidly create powerful stream processing applications.

Lesson 7

KSQL

Learn how to write simple SQL queries to turn Kafka topics into KSQL streams and tables, and then write those tables back out to Kafka.

Lesson 8 • Project

Optimizing Public Transportation

For your first project, you’ll be streaming public transit status using Kafka and the Kafka ecosystem to build a stream processing application that shows the status of trains in real-time.

Taught By The Best

Photo of Ben Goldberg

Ben Goldberg

Staff Engineer at SpotHero

In his career as an engineer, Ben Goldberg has worked in fields ranging from computer vision to natural language processing. At SpotHero, he founded and built out their data engineering team, using Airflow as one of the key technologies.

The Udacity Difference

Combine technology training for employees with industry experts, mentors, and projects, for critical thinking that pushes innovation. Our proven upskilling system goes after success—relentlessly.

Demonstrate proficiency with practical projects

Projects are based on real-world scenarios and challenges, allowing you to apply the skills you learn to practical situations, while giving you real hands-on experience.

  • Gain proven experience

  • Retain knowledge longer

  • Apply new skills immediately

Top-tier services to ensure learner success

Reviewers provide timely and constructive feedback on your project submissions, highlighting areas of improvement and offering practical tips to enhance your work.

  • Get help from subject matter experts

  • Learn industry best practices

  • Gain valuable insights and improve your skills