Skills you'll learn:
Data lakes and Lakehouses with Spark and Azure Databricks
Course
Learn about the big data ecosystem and how to use Spark to work with massive datasets. Learners will also store big data in a data lake and develop Lakehouse architecture on the Azure Databricks platform.
Learn about the big data ecosystem and how to use Spark to work with massive datasets. Learners will also store big data in a data lake and develop Lakehouse architecture on the Azure Databricks platform.
Intermediate
3 weeks
Last Updated May 21, 2024
Prerequisites:
Intermediate
3 weeks
Last Updated May 21, 2024
Skills you'll learn:
Prerequisites:
No experience required
Course Lessons
Lesson 1
Course Introduction
In this lesson, you'll learn about the course, including the prerequisites, tools, environment, and course project.
Lesson 2
Big Data Ecosystem, Data Lakes, and Spark
In this lesson, you will learn about the problems that Apache Spark is designed to solve. You'll also learn about the greater Big Data ecosystem and how Spark fits into it.
Lesson 3
Data Wrangling with Spark
In this lesson, we'll dive into how to use Spark for cleaning and aggregating data.
Lesson 4
Spark Debugging and Optimization
In this lesson, you will learn best practices for debugging and optimizing your Spark applications.
Lesson 5
Azure Databricks
In this lesson, you'll create Spark Clusters and Spark code on the Azure Databricks platform.
Lesson 6
Data Lakes and Lakehouse with Azure Databricks
In this lesson, you'll create data lakes and Lakehouse architecture on the Azure Databricks platform
Lesson 7 • Project
Building an Azure Data Lake for Bike Share Data Analytics
In this project, you'll implement Lakehouse architecture on the Azure Databricks platform.
Taught By The Best
Matt Swaffer
General Manager, MBS
Matt has been working in software development and data science for over 20 years. Matt's career is centered on the intersection of technology, data, and human psychology. He is passionate about using data science to have a meaningful impact on our people and our planet.
The Udacity Difference
Combine technology training for employees with industry experts, mentors, and projects, for critical thinking that pushes innovation. Our proven upskilling system goes after success—relentlessly.
Demonstrate proficiency with practical projects
Projects are based on real-world scenarios and challenges, allowing you to apply the skills you learn to practical situations, while giving you real hands-on experience.
Gain proven experience
Retain knowledge longer
Apply new skills immediately
Top-tier services to ensure learner success
Reviewers provide timely and constructive feedback on your project submissions, highlighting areas of improvement and offering practical tips to enhance your work.
Get help from subject matter experts
Learn industry best practices
Gain valuable insights and improve your skills
Enroll in Data lakes and Lakehouses with Spark and Azure Databricks. Choose the plan that works for you
All Access monthly
Unlimited access to our top-rated courses
Personalized Career Services
Cancel Anytime
Real-world projects
Personalized project reviews
Program certificates
Best Value
All Access bundle1
All the same great benefits as our monthly plan
The most cost-effective way to develop the skills you want
- 1Discount applies to the first 4 months of membership, after which plans are converted to month-to-month.
Your subscription also includes:
Your subscription also includes:
2 weeks
Intermediate
(4)
2 months
Advanced
2 weeks
3 weeks
Intermediate
2 weeks
Advanced
4 weeks
Advanced
3 weeks
Beginner
(147)
2 months
Advanced
4 weeks
Advanced
2 weeks
Advanced
4 weeks
Intermediate
4 weeks
Beginner
4 weeks
Intermediate
2 weeks
Intermediate
(91)
3 months
Advanced
1 month
Beginner