About this Course

Data science plays an important role in many industries. In facing massive amount of heterogeneous data, scalable machine learning and data mining algorithms and systems become extremely important for data scientists. The growth of volume, complexity and speed in data drives the need for scalable data analytic algorithms and systems. In this course, we study such algorithms and systems in the context of healthcare applications.

In healthcare, large amounts of heterogeneous medical data have become available in various healthcare organizations (payers, providers, pharmaceuticals). This data could be an enabling resource for deriving insights for improving care delivery and reducing waste. The enormity and complexity of these datasets present great challenges in analyses and subsequent applications to a practical clinical environment.

Course Cost
Free
Timeline
Approx. 0
Skill Level
Intermediate
Included in Course
  • Rich Learning Content

  • Interactive Quizzes

  • Taught by Industry Pros

  • Self-Paced Learning

  • Student Support Community

Join the Path to Greatness

This free course is your first step towards a new career with the Machine Learning Engineer Nanodegree Program.

Free Course

Big Data Analytics in Healthcare

by Georgia Institute of Technology

Enhance your skill set and boost your hirability through innovative, independent learning.

Icon steps 54aa753742d05d598baf005f2bb1b5bb6339a7d544b84089a1eee6acd5a8543d

Course Leads

  • Jimeng Sun
    Jimeng Sun

    Instructor

  • David Joyner
    David Joyner

    Instructor

What You Will Learn

Lesson 1

Big Data

  • Predictive Modeling
  • Dimensionality Reduction & Tensor Factorization
  • Graph Analysis
Lesson 1

Big Data

  • Predictive Modeling
  • Dimensionality Reduction & Tensor Factorization
  • Graph Analysis
Lesson 2

Healthcare

  • Computational Phenotyping
  • Patient Similarity Metrics
  • Medical Ontology
Lesson 2

Healthcare

  • Computational Phenotyping
  • Patient Similarity Metrics
  • Medical Ontology
Lesson 3

Technologies

  • MapReduce
  • Spark
  • Hadoop
Lesson 3

Technologies

  • MapReduce
  • Spark
  • Hadoop

Prerequisites and Requirements

Basic machine learning and data mining concepts such as classification and clustering;

Proficient programming and system skills in Python, Java and Scala;

Proficient knowledge and experience in dealing with data (recommended skills include SQL, NoSQL such as MongoDB).

See the Technology Requirements for using Udacity.

Why Take This Course

In this course, we introduce the characteristics of medical data and associated data mining challenges on dealing with such data. We cover various algorithms and systems for big data analytics. We focus on studying those big data techniques in the context of concrete healthcare analytic applications such as predictive modeling, computational phenotyping and patient similarity. We also study big data analytic technology:

Scalable machine learning algorithms such as online learning and fast similarity search;

Big data analytic system such as Hadoop family (Hive, Pig, HBase), Spark and Graph DB

What do I get?
  • Instructor videos
  • Learn by doing exercises
  • Taught by industry professionals
Icon globe e82eae5d45465aba4fbe4bb746905ce55dc3324f310b79c60e4a20089057d347

Udacity 现已提供中文版本! A Udacity tem uma página em português para você! There's a local version of Udacity for you! Sprechen Sie Deutsch?

Besuchen Sie de.udacity.com und entdecken Sie lokale Angebote, unsere Partnerunternehmen und Udacitys deutschsprachigen Blog.

前往优达学城中文网站 Ir para a página brasileira Go to Indian Site Icon flag de deedb1a7a695700236cb6ef4204ddbede5d197dab9b47716c87a0b4d5d9fc325 Zu de.udacity.com continue in English