Udacity Logo
Log InJoin for Free

Site Reliability Engineer

Nanodegree Program

The goal of the Site Reliability Engineer (SRE) Nanodegree program is to equip software developers with the engineering and operational skills required to build automation tools and responses that ensure designed solutions respond to non-functional requirements such as availability, performance, security, and maintainability. The content will focus on both designing systems to automate response to issues with software sites as well as how to respond to common on-call situations.

The goal of the Site Reliability Engineer (SRE) Nanodegree program is to equip software developers with the engineering and operational skills required to build automation tools and responses that ensure designed solutions respond to non-functional requirements such as availability, performance, security, and maintainability. The content will focus on both designing systems to automate response to issues with software sites as well as how to respond to common on-call situations.

Intermediate

4 months

Real-world Projects

Completion Certificate

Last Updated September 13, 2023

Skills you'll learn:
Toil reduction • Data recovery • Site reliability engineering business context • Deployment automation
Prerequisites:

No experience required

Courses In This Program

Course 1 45 minutes

Welcome!

Welcome! We're so glad you're here. Join us in learning a bit more about what to expect in this program and ways to succeed.

Course 2 4 weeks

Establishing a foundation in observability

In this course, we will learn about the founding concepts of Observability in terms of people and tools.

Course 3 4 weeks

Planning for High Availability and Incident Response

In this course, we will look at how SREs view availability and reliability for their infrastructure. We'll learn how to create effective monitoring using SLOs and SLIs. We will create dashboards in Grafana. Next, we'll identify all our IT assets, ensure they are configured for high availability. And then we will craft a disaster recovery plan to make sure failover is seamless and automated. After that, we'll deploy the infrastructure to AWS using Terraform. We'll learn the benefits of infrastructure as code. We'll see how easy it is to deploy to multiple regions. Finally, we'll learn how to make databases highly available and disaster recovery ready. We'll look at recovery strategies and implement them in AWS via Terraform.

Course 4 4 weeks

Self Healing Architectures

Self-healing architecture is resilient enough to withstand failure and resolve issues without human intervention through automation. In this course, you'll gain skills in self-healing architecture design strategies, deployment strategies, and cloud automation

Taught By The Best

Photo of Travis Scotto

Travis Scotto

Site Reliability Engineer

Travis has been working in IT for over 10 years. He's also been adjunct teaching for over 5 years. He loves technology and sharing his knowledge with students. Travis brings his industry experience as an SRE to the table in teaching different classes. He blends industry expertise with step by step teaching to allow students to excel! Seeing students succeed is what he likes best.

Photo of Emmanuel Apau

Emmanuel Apau

CTO of Mechanicode.io

Emmanuel is co-founder of the Black Code Collective and DC's Technical.ly RealLIST Engineer award recipient. An AWS Certified DevSecOps specialist with 12 years of experience, he has spent his career developing innovative solutions using DevSecOps & Site reliability best practices.

Photo of Sonny Sevin

Sonny Sevin

Site Reliability Engineer

Sonny is an SRE with a varied background. He has dabbled in research at Lawrence Berkeley National Labs before moving into site reliability engineering to have a more hands on role. He has been published in several computing journals, as well as taught introductory programming courses.

Photo of Nathan Anderson, MBA

Nathan Anderson, MBA

Global Cloud Architect

Nathan is a Certified Six Sigma Black Belt and has 10+ years of experience in IT in multiple industries. He is also the Instructor for two other Udacity courses: Ensuring Quality Releases and Azure Performance.

Ratings & Reviews

Average Rating: 4.5 Stars

(9 Reviews)

Page 1 of 1

The Udacity Difference

Combine technology training for employees with industry experts, mentors, and projects, for critical thinking that pushes innovation. Our proven upskilling system goes after success—relentlessly.

Demonstrate proficiency with practical projects

Projects are based on real-world scenarios and challenges, allowing you to apply the skills you learn to practical situations, while giving you real hands-on experience.

  • Gain proven experience

  • Retain knowledge longer

  • Apply new skills immediately

Top-tier services to ensure learner success

Reviewers provide timely and constructive feedback on your project submissions, highlighting areas of improvement and offering practical tips to enhance your work.

  • Get help from subject matter experts

  • Learn industry best practices

  • Gain valuable insights and improve your skills