Udacity Logo
Log InSign Up

Big Data Systems

Course

There are 2 main roles in the Big Data industry: Big Data Engineer and Big Data Architect. We will focus on the Architect role. We will look at the characteristics of Big Data, its business value, and how companies are using Big Data today. We study the most popular storage and processing frameworks.....aka...Big Data ecosystem components. We will dive deep into NoSQL, how it differs from traditional relational databases, how to model, and what the tool interface looks like. Finally, we will talk about the benefits, challenges, design patterns of Data Lake technology. Your final project is based on a real-world scenario that will require you to think like an architect and build an end-to-end data lake system proposal.

There are 2 main roles in the Big Data industry: Big Data Engineer and Big Data Architect. We will focus on the Architect role. We will look at the characteristics of Big Data, its business value, and how companies are using Big Data today. We study the most popular storage and processing frameworks.....aka...Big Data ecosystem components. We will dive deep into NoSQL, how it differs from traditional relational databases, how to model, and what the tool interface looks like. Finally, we will talk about the benefits, challenges, design patterns of Data Lake technology. Your final project is based on a real-world scenario that will require you to think like an architect and build an end-to-end data lake system proposal.

4 weeks

Real-world Projects

Completion Certificate

Last Updated May 8, 2023

Prerequisites:

No experience required

Course Lessons

Lesson 1

Introduction to Big Data Systems

In this lesson, we will take a 30000 foot view of Big Data and see why it is so important. We will meet the instructor and hear about the components of the course, including the final project.

Lesson 2

Characteristics of Big Data

In this lesson you will learn about the main characteristics of Big Data, called the 4Vs. You will also start to explore the Big Data ecosystem.

Lesson 3

Ingestion, Storage and Processing Frameworks

In this lesson, you'll take a look at several of the layers that make Big Data possible, We will also look at some of the tools that help implement those layers.

Lesson 4

NoSQL Databases

In this lesson, we will look at the differences between NoSQL and SQL. We will also see why and how NoSQL databases provide capabilities that allow Big Data to be possible.

Lesson 5

Scalable Data Lake Architecture

In this lesson, we will see what a Data Lake storage implementation of Big Data looks like. In addition to the benefits, we will see what considerations, risks, and challenges organizations face.

Lesson 6 • Project

Project - Designing an Enterprise Data Lake System

In this lesson, we will lead you through the scenario and instructions for completing the final project, which is a proposal for an actual Data Lake architecture.

Taught By The Best

Photo of Shrinath Parikh

Shrinath Parikh

Senior Data Architect

Shrinath is an entrepreneur and Data Architect passionate about helping enterprise companies transform and engineer their big data analytics applications on Cloud. He has worked with AWS, Google and Microsoft cloud platforms, has over 15 certifications and an MS in Computer Science from The University Of Texas at Dallas.

Taught By The Best

Photo of Shrinath Parikh

Shrinath Parikh

Senior Data Architect

Shrinath is an entrepreneur and Data Architect passionate about helping enterprise companies transform and engineer their big data analytics applications on Cloud. He has worked with AWS, Google and Microsoft cloud platforms, has over 15 certifications and an MS in Computer Science from The University Of Texas at Dallas.

Get Started Today

Big Data Systems