Hi Donald,
I’m using the Copula model in explore to estimate CD & CI graphs, with impute set to TRUE. I’m then using the posterior_predict function to draw samples from the PPD, & the predicted_probability function to estimate conditional probabilities (& 95% intervals) of the presence of a health condition given scores on items that were conditionally dependently associated in the copula GGM.
I’m also using the PPD samples to perform qualitative Post. Predictive Checks (PPC) by using the bayes plot package (ppc_bars) to compare the replicated data to the empirical, as well as looking at how well model simulated data reproduces item skewness.
ppc_bars however only accepts datasets of the same length, so what I’ve done at the moment is remove the entries corresponding to rows with missing data & compared the empirical to PPD samples with missing data rows removed - but I can't find a lot of information on how to best approach posterior predictive checks when data has been imputed, & as I'm not a statistician I'm not convinced this current approach is remotely legitimate.
I was therefore wondering if there is a way to get the imputed dataset from the explore object so that the complete data set (observed + imputed) can be compared to the replicate data sets sampled from the PPD?
Many thanks in advance.
Tom
Hi Donald,
I’m using the Copula model in explore to estimate CD & CI graphs, with impute set to TRUE. I’m then using the posterior_predict function to draw samples from the PPD, & the predicted_probability function to estimate conditional probabilities (& 95% intervals) of the presence of a health condition given scores on items that were conditionally dependently associated in the copula GGM.
I’m also using the PPD samples to perform qualitative Post. Predictive Checks (PPC) by using the bayes plot package (ppc_bars) to compare the replicated data to the empirical, as well as looking at how well model simulated data reproduces item skewness.
ppc_bars however only accepts datasets of the same length, so what I’ve done at the moment is remove the entries corresponding to rows with missing data & compared the empirical to PPD samples with missing data rows removed - but I can't find a lot of information on how to best approach posterior predictive checks when data has been imputed, & as I'm not a statistician I'm not convinced this current approach is remotely legitimate.
I was therefore wondering if there is a way to get the imputed dataset from the explore object so that the complete data set (observed + imputed) can be compared to the replicate data sets sampled from the PPD?
Many thanks in advance.
Tom