I have a question. If nmc=100000, thin=10 and outpred=outpred1, does this mean I will have a dataset named outpred1 contains 10000 observations? Thx. @jwb133 wrote: After discussing with SAS support, the following is my understanding of how preddist works. Suppose PROC MCMC is called using NMC=x, so x iterations of the MCMC sampler will be performed. Suppose that NSIM=y is specified, requesting y draws from the posterior predictive dsitribution of the outcome. To produce the y draws from posterior predictive distribution, PROC MCMC samples y parameter values, with replacement, from the NMC=x samples from the posterior distribution. For each, it then simulates a value of y from its distribution conditional on the drawn parameter value. This approach would seem to be valid if the y parameter values which are found by drawing with replacement from the NMC=x iterations are i.i.d. However, it would seem that potentially this does not hold. Suppose for example that we (perhaps stupidly) choose NMC=10 and NSIM=100000. Then the drawn values of the outcome are being drawn conditional on one of 10 parameter values. These 10 parameter values are probably correlated to some extent, and moreover you then have many draws of the outcome variable being made conditional on the same parameter value. In this (perhaps contrived) scenario, the 100000 values would not be (I contend) valid draws from the posterior predictive distribution.
... View more