Udacity Logo
Log InSign Up

Introduction to Data Analysis with Pandas and NumPy

Course

Learn the data analysis process of questioning, wrangling, exploring, analyzing, and communicating data. You will work with data in Python using libraries like NumPy and pandas.

Learn the data analysis process of questioning, wrangling, exploring, analyzing, and communicating data. You will work with data in Python using libraries like NumPy and pandas.

3 weeks

Real-world Projects

Completion Certificate

Last Updated August 19, 2023

Prerequisites:

No experience required

Course Lessons

Lesson 1

The Data Analysis Process

Learn about the data analysis process and the Python packages used in this course

Lesson 2

Jupyter Notebooks

Jupyter Notebooks are a great tool for sharing insights and visualizations alongside your code. This lesson covers how to create them and utilize their various features.

Lesson 3

Exploring and Inspecting Data

Use the pandas library to load data, view its properties, and start asking data analysis questions

Lesson 4

Manipulating Data using Pandas and NumPy

Use the pandas library to perform data cleaning, filtering, and reshaping tasks. This includes troubleshooting issues with data as well as optimizing for memory usage and speed.

Lesson 5

Communicating Results

Draw conclusions and communicate results to stakeholders by calculating statistics and creating basic data visualizations with the pandas library

Lesson 6 • Project

Investigate a Dataset

Choose one of Udacity's curated datasets, perform an investigation, and share your findings.

Taught By The Best

Photo of Matt Maybeno

Matt Maybeno

Principal Software Engineer

Matt is a Principal Software Engineer at SOCi. With a masters in Bioinformatics from SDSU, he utilizes his cross domain expertise to build solutions in NLP and predictive analytics.

Taught By The Best

Photo of Matt Maybeno

Matt Maybeno

Principal Software Engineer

Matt is a Principal Software Engineer at SOCi. With a masters in Bioinformatics from SDSU, he utilizes his cross domain expertise to build solutions in NLP and predictive analytics.

Get Started Today

Introduction to Data Analysis with Pandas and NumPy