About this Course

This course will introduce you to the world of data analysis. You'll learn how to go through the entire data analysis process, which includes:

  • Posing a question
  • Wrangling your data into a format you can use and fixing any problems with it
  • Exploring the data, finding patterns in it, and building your intuition about it
  • Drawing conclusions and/or making predictions
  • Communicating your findings

You'll also learn how to use the Python libraries NumPy, Pandas, and Matplotlib to write code that's cleaner, more concise, and runs faster.

This course is part of the Data Analyst Nanodegree.

Course Cost
Free
Timeline
Approx. 6 weeks
Skill Level
Beginner
Included in Course
  • Rich Learning Content

  • Interactive Quizzes

  • Taught by Industry Pros

  • Self-Paced Learning

  • Student Support Community

Join the Path to Greatness

This free course is your first step towards a new career with the Data Analyst Nanodegree Program.

Free Course

Intro to Data Analysis

Enhance your skill set and boost your hirability through innovative, independent learning.

Icon steps 54aa753742d05d598baf005f2bb1b5bb6339a7d544b84089a1eee6acd5a8543d

Course Leads

  • Caroline Buckey
    Caroline Buckey

    Instructor

What You Will Learn

Lesson 1

Data Analysis Process

  • Learn about the data analysis process.
  • Pose a question, wrangle your data, draw conclusions and/or make predictions.
  • Complete an analysis of Udacity student data using pure Python, with few additional libraries.
Lesson 1

Data Analysis Process

  • Learn about the data analysis process.
  • Pose a question, wrangle your data, draw conclusions and/or make predictions.
  • Complete an analysis of Udacity student data using pure Python, with few additional libraries.
Lesson 2

NumPy and Pandas for 1D Data

  • Start learning to use NumPy and Pandas to make the data analysis process easier.
  • Features that apply to one-dimensional data.
  • Learn to use NumPy arrays, Pandas Series, and vectorized operations.
Lesson 2

NumPy and Pandas for 1D Data

  • Start learning to use NumPy and Pandas to make the data analysis process easier.
  • Features that apply to one-dimensional data.
  • Learn to use NumPy arrays, Pandas Series, and vectorized operations.
Lesson 3

NumPy and Pandas for 2D Data

  • Continue learning about NumPy and Pandas, this time focusing on two-dimensional data.
  • Learn to use two-dimensional NumPy arrays and Pandas DataFrames.
  • Group your data and to combine data from multiple files.
Lesson 3

NumPy and Pandas for 2D Data

  • Continue learning about NumPy and Pandas, this time focusing on two-dimensional data.
  • Learn to use two-dimensional NumPy arrays and Pandas DataFrames.
  • Group your data and to combine data from multiple files.
Lesson 4

Investigate a Dataset

  • Use NumPy and Pandas to go through the data analysis process on one of a list of recommended datasets.
Lesson 4

Investigate a Dataset

  • Use NumPy and Pandas to go through the data analysis process on one of a list of recommended datasets.

Prerequisites and Requirements

To take this course, you need to be comfortable programming in Python.

  • You should be familiar with if statements, loops, functions, lists, sets, and dictionaries. To learn about any of these topics, take the course Intro to Computer Science.
  • You should also be familiar with classes, objects, and modules. To learn about these topics, take the course Programming Foundations with Python.

See the Technology Requirements for using Udacity.

Why Take This Course

This course is a good first step towards understanding the data analysis process as a whole. Before delving into each individual phase, it is important to learn the difference between all phases of the process and how they relate to each other. After taking this course, you will be better positioned to succeed in other courses in the Data Analyst Nanodegree program. For example, a student who started with Data Analysis with R, which covers the exploratory data analysis phase, might not understand at that point the difference between data exploration and data wrangling. By taking this course first, you will learn what each phase accomplishes and how it fits into the larger process.

This course also covers the Python libraries NumPy, Pandas, and Matplotlib, which are indispensable tools for doing data analysis in Python. Their many convenient functions and high performance make writing data analysis code a lot easier!

What do I get?
  • Instructor videos
  • Learn by doing exercises
  • Taught by industry professionals
Icon globe e82eae5d45465aba4fbe4bb746905ce55dc3324f310b79c60e4a20089057d347

Udacity 现已提供中文版本! A Udacity tem uma página em português para você! There's a local version of Udacity for you! Sprechen Sie Deutsch?

Besuchen Sie de.udacity.com und entdecken Sie lokale Angebote, unsere Partnerunternehmen und Udacitys deutschsprachigen Blog.

前往优达学城中文网站 Ir para a página brasileira Go to Indian Site Icon flag de deedb1a7a695700236cb6ef4204ddbede5d197dab9b47716c87a0b4d5d9fc325 Zu de.udacity.com continue in English