Introduction to Understanding and Handling Missing Data: From Descriptives to Replacement

Hosted by the Stigma and Resilience Among Vulnerable Youth Centre (SARAVYC)

Date: 19 Sep 201721 Sep 2017

Presented by: Lauree Tilton-Weaver, Ph.D., Professor of Psychology Örebro University, Sweden

Room: UBC Hospital

Workshop Summary: This SARAVYC workshop focuses on understanding missing data and learning principled methods of replacing missing data. The workshop includes understanding how to describe missing data (amount, patterns, mechanisms, newer assessments) and how to deal with missing data. We will cover proactive methods (planned missingness) and reactive methods for handling missing data. More coverage of two modern methods—maximum likelihood estimation and multiple imputation—will be presented in more depth, with an introduction to R packages that can be used to maximize these methods, including derivation of principled components to be used as auxiliary variables for estimation and imputation. Participants are encouraged to bring their own computers and data, for exploring these methods in their own studies.

Note: An understanding of basic to intermediate statistics is assumed (measurement levels, descriptive and inferential statistics, basic understanding of data reduction techniques and linear models). Basic understanding of R is useful, but not required. Prior to the course, load R and Rstudio on your computer, so that you can practice the tutorials and explore your own data.

Registration limited. RSVP for registration, UBC room locations & recommended readings to: Monica Shannon: mshannon93@alumni.ubc.ca

TENTATIVE SCHEDULE:

Day 1: Introduction (lecture and discussions):  (9-12; 1:30 to 3:30)

  • The importance of handling missing data
  • Descriptives: amounts and patterns
  • Mechanisms of missingness – and attempting to identify them
  • Proactive methods: planned missingness
  • Reactive methods: Traditional versus modern methods
  • Descriptions of maximum likelihood estimation and multiple imputation
  • Description of the principle component method for auxiliary variables

Day 2: The PCA method of auxiliary variables (9-12; 1:30 to 5)

  • Verbal walkthrough, R walkthrough, practice tutorials
  • Verbal walkthrough of multiple imputations using R, practice with R tutorials
  • Pooling estimates from multiple imputation – practice

Day 3: Individual practice—on tutorials or own data, with guidance from Tilton-Weaver (9-12; 1:30 to 5)