Module 3: Introduction to Data Analysis

This module introduces fundamental concepts in data analysis. First, you will read a report from the Association of Accountants and Financial Professionals in Business that explores Big Data in Accountancy and read a blog article that discusses how a programming language like Python offers specific advantages over performing data analytics in Excel. Next, you will learn about the Unix file system, which is the operating system used for most big data processing (as well as Linux and Mac OSX desktops and many mobile phones). Second, you will learn how to read and write data to a file from within a Python program. Finally, you will learn about the Pandas Python module that can simplify many challenging data analysis tasks, and includes the DataFrame, which programmatically mimics many of the features of a traditional spreadsheet.

Learning Objectives

By the end of this module, you should be able to:

  • work with the Unix file system and understand the concept of directories and files,
  • read and write data to files from within a Python script, and
  • use a Pandas Series and DataFrame to manipulate data from within a Python script.

Activities and Assignments

Activities and Assignments Time Estimate Deadline Points
Module 3 Overview Video 10 Minutes N/A N/A
Module 3 Lesson 1: Coding and Accountancy 1 Hour N/A N/A
Module 3 Lesson 2: Introduction to Unix 1 Hour N/A N/A
Module 3 Lesson 3: Python File I/O 1 Hour N/A N/A
Module 3 Lesson 4: Introduction to Pandas 2 Hours N/A N/A
Module 3 Assignment 1 hour N/A N/A

*Please note that unless otherwise noted, the due time is 6pm Central time!


© 2017: Robert J. Brunner at the University of Illinois.

This notebook is released under the Creative Commons license CC BY-NC-SA 4.0. Any reproduction, adaptation, distribution, dissemination or making available of this notebook for commercial use is not allowed unless authorized in writing by the copyright holder.