Bootstrapping data and fits for parameter uncertainty estimation

Code

library(tidyverse)
library(targets)
library(GGally)
library(kableExtra)

# load "R/*" scripts and saved R objects from the targets pipeline
tar_source()
tar_load(c(exp1_data, exp2_data, exp1_data_agg, exp2_data_agg, exp3_data_agg))
set.seed(213)

# make sure that if the model fails to converge, it doesn't stop the whole process
safe_boot_est <- purrr::safely(boot_est)

Overview

To get an estimate of the uncertainty in the parameter estimates, we can use bootstrapping. This involves resampling the data with replacement and fitting the model to each resampled dataset. We do this:

Resample the IDs of participants in the dataset with replacement.
For each subject, calculate the proportion of correct responses in each condtion and resample the observed counts from a binomial distribution with the probability of success equal to the proportion of correct responses.
Estimate the model parameters from the resampled data.
Repeat steps 1-3 1000 times to get a distribution of parameter estimates.

Experiment 1

For now I’ve done it just for experiment 1. Here are the results:

Code

res_asy <- run_or_load(
  do.call(rbind, replicate(1000, safe_boot_est(exp1_data)$result)),
  "output/res_boot1000_asy.rds")

plot_bootstrap_results(res_asy)

and here are the 95% highest density intervals for the rate parameters

Code

round(HDInterval::hdi(res_asy$rate), 3)

lower upper 
0.056 0.213 
attr(,"credMass")
[1] 0.95

from these bootstrap estimates, we can calculate how long it would take for 50% of the resources to recover from 0 (also see here).

Code

t_est <- -log(0.5) / res_asy$rate

round(HDInterval::hdi(t_est), 3)

 lower  upper 
 2.799 10.676 
attr(,"credMass")
[1] 0.95

It would take on average \(6.1883827\) seconds for 50% of the resources to recover from 0, with a 95% HDI of \(2.7987571, 10.67613\).