Skip to content

Data Wrangling

Course

Learn the most efficient methods for gathering raw “dirty” data and cleaning it up for analysis. After finishing this course, you’ll be able to gather messy data from a variety of sources and “tidy” it up in no time using the powerful Python library, pandas.

Enroll Now
  • Estimated time
    1 month

  • Enroll by
    June 14, 2023

    Get access to classroom immediately on enrollment

  • Skills acquired
    Data Preparation, Data Ingestion, Data Cleaning
In collaboration with
  • Kaggle
  • Mode

What You Will Learn

  1. Data Wrangling

    1 month to complete

    Knowing how to “wrangle data” (aka write code that cleans up “dirty” data and shapes it into useful formats) is a universally sought-after skill across data-driven companies. In this course you’ll learn how to leverage the power of Python to quickly gather, assess & clean messy data so it can be explored and analyzed easily.

    Prerequisite knowledge

    Python & SQL.

    1. Intro to Data Wrangling

      Identify each step of the data wrangling process (gathering assessing, and cleaning) and wrangle a CSV file downloaded from Kaggle using fundamental gathering, assessing and cleaning code.

      • Gathering Data

        Gather data from multiple sources, including gathering files, programmatically downloading files, web-scraping data and accessing data from APIs.

        • Assessing Data

          Assess data visually and programmatically using pandas and identify data quality issues and categorize them using metrics: validity, accuracy, completeness, consistency and uniformity.

          • Cleaning Data

            Identify each step of the data cleaning process (defining, coding and testing) and clean data using Python and Pandas.

            • Course Project: Wrangle and Analyze Data

              Real-world data rarely comes clean. Using Python, you’ll gather data from a variety of sources, assess its quality and tidiness, then clean it. You’ll document your wrangling efforts in a Jupyter Notebook, plus showcase them through analyses and visualizations using Python and SQL.

            All Our Courses Include

            • Real-world projects from industry experts

              With real-world projects and immersive content built in partnership with top-tier companies, you’ll master the tech skills companies want.

            • Real-time support

              On demand help. Receive instant help with your learning directly in the classroom. Stay on track and get unstuck.

            • Workspaces

              Validate your understanding of concepts learned by checking the output and quality of your code in real-time.

            • Flexible learning program

              Tailor a learning plan that fits your busy life. Learn at your own pace and reach your personal goals on the schedule that works best for you.

            Course offerings

            • Class content

              • Real-world projects
              • Project reviews
              • Project feedback from experienced reviewers
            • Student services

              • Student community
              • Real-time support

            Succeed with personalized services.

            We provide services customized for your needs at every step of your learning journey to ensure your success.

            Get timely feedback on your projects.

            • Personalized feedback
            • Unlimited submissions and feedback loops
            • Practical tips and industry best practices
            • Additional suggested resources to improve
            • 1,400+

              project reviewers

            • 2.7M

              projects reviewed

            • 88/100

              reviewer rating

            • 1.1 hours

              avg project review turnaround time

            Learn with the best.

            Learn with the best.

            • Josh Bernhard

              Data Scientist at Nerd Wallet

              Josh has been sharing his passion for data for nearly a decade at all levels of university, and as Lead Data Science Instructor at Galvanize. He's used data science for work ranging from cancer research to process automation.

            • David Venturi

              Data Analyst Instructor

              Formerly a chemical engineer and data analyst, David created a personalized data science master's program using online resources. He has studied hundreds of online courses and is excited to bring the best to Udacity students.

            • Sam Nelson

              Product Lead

              Sam is the Product Lead for Udacity’s Data Analyst, Business Analyst, and Data Foundations programs. He’s worked as an analytics consultant on projects in several industries, and is passionate about helping others improve their data skills.

            Data Wrangling

            Get started today

              • Learn

                How to quickly gather, assess, and clean data from multiple sources using Python’s popular pandas library.

              • Average Time

                On average, successful students take 1 month to complete this program.

              • Benefits include

                • Real-world projects from industry experts
                • Real-time support

              Program Details

              • Do I need to apply? What are the admission criteria?

                No. This Course accepts all applicants regardless of experience and specific background.

              • What are the prerequisites for enrollment?

                In order to succeed in this program, we recommend having experience working with SQL and with data in Python, ideally with the NumPy and/or pandas libraries.

              • How is this course structured?

                The Data Wrangling course is comprised of content and curriculum to support one project. We estimate that students can complete the program in 1 month.

                The project will be reviewed by the Udacity reviewer network and platform. Feedback will be provided and if you do not pass the project, you will be asked to resubmit the project until it passes.

              • How long is this course?

                Access to this course runs for the length of time specified in the payment card above. If you do not graduate within that time period, you will continue learning with month to month payments. See the Terms of Use and FAQs for other policies regarding the terms of access to our programs.

              • Can I switch my start date? Can I get a refund?

                Please see the Udacity Program Terms of Use and FAQs for policies on enrollment in our programs.

              • What software and versions will I need in this course?

                You will need access to the Internet, and a 64 bit computer. Additional software such as Python and its common data analysis libraries (e.g., Numpy and Pandas) will be required, but the program will guide students on how to download once the course has begun.

              Data Wrangling

              Enroll Now