Module 7: Exploring Two-Dimensional Data

This modules extends what you have learned in previous modules to the visual and analytic exploration of two-dimensional data. First, you will learn how to make two-dimensional scatter plots in Python and how they can be used to graphically identify a correlation and outlier points. Second, you will learn how to work with two-dimensional data by using the NumPy module, including a discussion on analytically quantifying correlations in data. Third, you will read about statistical issues that can impact understanding multi-dimensional data, which will allow you to avoid them in the future. Finally, you will learn about ordinary linear regression and how this technique can be used to model the relationship between two variables.

Learning Objectives

By the end of this module, you should be able to:

  • display and interpret two-dimensional data by using Python,
  • work with multi-dimensional arrays in NumPy,
  • quantify correlations between data by using Python,
  • understand statistical concerns when working with multi-dimensional data, and
  • compute and interpret linear models.

Activities and Assignments

Activities and Assignments Time Estimate Deadline Points
Module 7 Overview Video 10 Minutes N/A N/A
Module 7 Lesson 1: Introduction to Scatter Plots 1 Hour N/A N/A
Module 7 Lesson 2: Introduction to NumPy Matrices 2 Hours N/A N/A
Module 7 Lesson 3: Introduction to Statistical Issues 1 Hour N/A N/A
Module 7 Lesson 4: Introduction to Ordinary Linear Regression 2 Hours N/A N/A
Module 7 Assignment 2 Hours N/A N/A

© 2017: Robert J. Brunner at the University of Illinois.

This notebook is released under the Creative Commons license CC BY-NC-SA 4.0. Any reproduction, adaptation, distribution, dissemination or making available of this notebook for commercial use is not allowed unless authorized in writing by the copyright holder.