Lesson 1
Welcome
Introduction to Data Streaming with Spark
Course
In this course you will grow your expertise in the components of streaming data systems, and build a real time analytics application. Specifically, you will be able to identify components of Spark Streaming (architecture and API), build a continuous application with Structured Streaming, consume and process data from Apache Kafka with Spark Structured Streaming (including setting up and running a Spark Cluster), create a DataFrame as an aggregation of source DataFrames, sink a composite DataFrame to Kafka, and visually inspect a data sink for accuracy.
In this course you will grow your expertise in the components of streaming data systems, and build a real time analytics application. Specifically, you will be able to identify components of Spark Streaming (architecture and API), build a continuous application with Structured Streaming, consume and process data from Apache Kafka with Spark Structured Streaming (including setting up and running a Spark Cluster), create a DataFrame as an aggregation of source DataFrames, sink a composite DataFrame to Kafka, and visually inspect a data sink for accuracy.
Advanced
4 weeks
Real-world Projects
Completion Certificate
Last Updated August 29, 2023
Skills you'll learn:
Prerequisites:
Lesson 1
Introduction to Data Streaming with Spark
Lesson 2
In this lesson, you'll learn about working with Spark Dataframes and views.
Lesson 3
In this lesson, you'll learn how to work with JSON and complete Joins for data streaming.
Lesson 4
This lesson will focus on working with Redis, Base64, and JSON in Data Streaming.
Lesson 5 • Project
As your final project for this course, you will demonstrate the skills you have learned by evaluating human balance with spark streaming.
Professor at Brigham Young University Idaho
Sean currently teaches cybersecurity and DevOps courses at Brigham Young University Idaho. He has been a software engineer for over 16 years. Some of the most exciting projects he has worked on involved data pipelines for DNA processing and vehicle telematics.
Combine technology training for employees with industry experts, mentors, and projects, for critical thinking that pushes innovation. Our proven upskilling system goes after success—relentlessly.
Demonstrate proficiency with practical projects
Projects are based on real-world scenarios and challenges, allowing you to apply the skills you learn to practical situations, while giving you real hands-on experience.
Gain proven experience
Retain knowledge longer
Apply new skills immediately
Top-tier services to ensure learner success
Reviewers provide timely and constructive feedback on your project submissions, highlighting areas of improvement and offering practical tips to enhance your work.
Get help from subject matter experts
Learn industry best practices
Gain valuable insights and improve your skills
Full Catalog Access
One subscription opens up this course and our entire catalog of projects and skills.
Average time to complete a Nanodegree program
4 weeks
, Advanced
(127)
2 months
, Advanced
4 weeks
, Intermediate
4 weeks
, Intermediate
8 hours
4 weeks
, Intermediate
4 weeks
, Advanced
3 weeks
, Advanced
4 weeks
, Beginner
4 weeks
, Intermediate
4 weeks
, Intermediate
4 weeks
, Beginner
(2)
4 months
, Advanced
4 weeks
, Intermediate
4 weeks
, Intermediate
4 weeks
, Intermediate
Streaming API Development and Documentation
4 weeks
, Advanced
(127)
2 months
, Advanced
4 weeks
, Intermediate
4 weeks
, Intermediate
8 hours
4 weeks
, Intermediate
4 weeks
, Advanced
3 weeks
, Advanced
4 weeks
, Beginner
4 weeks
, Intermediate
4 weeks
, Intermediate
4 weeks
, Beginner
(2)
4 months
, Advanced
4 weeks
, Intermediate
4 weeks
, Intermediate
4 weeks
, Intermediate