The Use of Computation and Computational Techniques for Data Analysis

by Michelle L. Isenhour, PhD

mlisenho@nps.edu

After reading all of the participant essays, what word do you think was the most commonly used word?

Background

As a service course for other departments, I teach basic (undergraduate) statistics and data analysis to students at the graduate level.

The purpose of my course is to prepare students for computational data analysis they will see in their downstream courses, as well as in their future careers as military officers and US Department of Defense civilians. 

Computational Learning Goals

  • At the completion of the course, students will be able to:
    • Acquire data, perform basic data cleaning, and transform variables to facilitate analysis.
    • Perform exploratory and inferential methods for analyzing data and apply the methods in real-world contexts.
    • Formulate simple algorithms to solve problems and code them using statistical software.
    • Fit and assess the adequacy of data-based models using statistical software.

Modules

Students are exposed to the following topics:        

  1. Exploratory Data Analysis: Plots and Descriptive Statistics
  2. Statistical Inference: Parameter Estimation
  3. Statistical Inference: Hypothesis Testing
  4. Statistical Inference: Analysis of Variance
  5. Statistical Inference: Regression Analysis

The "Evolution"

Last year was my first year teaching with MATLAB.

Incorporated MATLAB Live Scripts and MATLAB Grader.

This year, migrated towards self-paced materials.

Incorporating MATLAB

  • MATLAB Live Script (.mlx)                                                    
    • Incorporate lesson material with code examples

Incorporating MATLAB

  • MATLAB Script (.m)                                                               
    • Code examples with comments

Incorporating MATLAB

  • MATLAB Grader                                                                     
    • Instant feedback on student coding attempts

MATLAB Live Scripts

  • MATLAB Live Scripts (.mlx)                                                                           
    • Confidence Intervals for Mean
    • Confidence Intervals for Variance
    • Confidence Intervals for Population Proportion
    • One Sample Hypothesis Tests
    • Two Sample Hypothesis Tests
    • Paired Sample Hypothesis Tests
    • Measures of Linear Relationships
    • Simple Linear Regression
    • More on Simple Linear Regression
    • Modeling with Simple Linear Regression
    • Multiple Linear Regression
    • Multiple Linear Regression with Categorical Variables
    • Using Regression Models to Make Predictions
    • Assessing Model Adequacy
    • Regression with Transformed Variables
    • Logistic Regression
    • Analysis of Categorical Data

Teaching Activity

MATLAB Resources

Linear Regression:

  1. Interpret Linear Regression Results     > Documentation
  2. Linear Regression     > Documentation 
  3. Linear Regression with Interaction Effects      > Documentation
  4. Summary of Output and Diagnostic Statistics    > Documentation     
  5. F-statistic and t-statistic    > Documentation  
  6. Coefficient of Determination (R-Squared)    > Documentation  
  7. Coefficient Standard Errors and Confidence Intervals   > Documentation                      
  8. Residuals   > Documentation  
  9. Generalized Linear Models  > Documentation 

Hypothesis Testing:

  1. One-Sample z-Test      > Documentation 
  2. One-Sample t-Test      > Documentation 
  3. Two-Sample t-Test      > Documentation
  4. Sample Size and Power for Hypothesis Tests   > Documentation     > Example             

Analysis of Variance:          

  1. Analysis of Variance and Covariance      > Documentation                                                 
  2. anova1      > Documentation 
  3. anova2     > Documentation
  4. anovan     > Documentation     
  5. multcompare   > Documentation  

Pre-Class Self-Assessment

  • Pre-class student self-assessment:
    • Read/review the material and attempt a pre-class assignment.
    • Review the solution, along with a MATLAB script file (or Live Script).
    • Identify deficiencies prior to attending the class lecture.
  • Pre-class self-assessment via individual student blog:
    • Map the content to real-world examples and explain how they would utilize the techniques.
    • Answer specific questions to assess understanding.

Post-Class Assessment

  • Post-class computer-based assessment:
    • Retake the computational assessment at the end of each lesson an unlimited number of times.
    • Open book, open notes and the use of any statistical software package is authorized.
    • Immediate feedback provided to the student.
  • Post-module laboratory exercise:
    • Hands-on computational assessment, beginning in class and continuing out-of-class.
    • Culminates in a two-page report, executive summary, or web page.
  • Post-course final exam:
    • Two week out-of-class (take home) assessment.
    • Approximately 30 questions, 20 True/False and Multiple Choice and 10 computational problems.

Questions?

Michelle L. Isenhour, PhD

mlisenho@nps.edu

Linked In: https://www.linkedin.com/in/michelle-isenhour

The Use of Computation and Computational Techniques for Data Analysis

By Michelle L. Isenhour

The Use of Computation and Computational Techniques for Data Analysis

This presentation, prepared for the 2019 Teaching Computation in the Sciences Using MATLAB workshop, describes how computation and computational techniques are incorporated in a graduate-level course on data analysis at the Naval Postgraduate School.

  • 1,109