Fake It Until You Make It

How and why to simulate research data

Lisa DeBruine

Abstract

Being able to simulate data allows you to prep analysis scripts for pre-registration, calculate power and sensitivity for analyses that don’t have empirical methods, create reproducible examples when your data are too big or confidential to share, enhance your understanding of statistical concepts, and create demo data for teaching and tutorials. This workshop will cover the basics of simulation. We will simulate data with factorial designs by specifying the within and between-subjects factor structure, each cell mean and standard deviation, and correlations between cells where appropriate. This can be used to create simulated data sets to be used in preparing the analysis code for pre-registrations or registered reports. We will also create data sets for simulation-based power analyses.

Why Simulate Data?

Pre-Registration

Prep analysis scripts for pre-registration

Power

Calculate power and sensitivity for analyses that don’t have empirical methods

Reproducible Examples

Create reproducible examples when your data are too big or confidential to share

Enhance Understanding

Enhance your understanding of statistical concepts

Teaching Data

Create demo data for teaching and tutorials

Faux

Web App: rstudio-connect.psy.gla.ac.uk/faux/

Plot

Data

Code

Further Resources

PsyPag Simulation Summer School

Data Simulation Workshops

Exercises

] add https://github.com/debruine/Faux.jl