BookmarkSubscribeRSS Feed
H
Pyrite | Level 9 H
Pyrite | Level 9

I just got the "MI of missing data with SAS" book last night. I ran code from section 6.3 (for arbitrary missing data, categorical > 2 groups, using FCS discrim).

Everything worked fine. I have the imputed data from all of the iterations,.., etc., but what do I use in the original dataset. Is there a way to kick out the original dataset with these data inserted into it?

3 REPLIES 3
ballardw
Super User

Most of us likely don't have the book. It would help to show the code you did run to generate the output.

H
Pyrite | Level 9 H
Pyrite | Level 9

Thanks ballardw.

I am now under the impression that you will not typically get a single imputed dataset (with the pooled values). Instead you get the n imputed datasets and a pulled estimate of those datasets. For my example,  that was the percentages in each of the groups for the missing categorical variable. I think that is the answer to my question.

Other scenarios where the purpose may be inferential statistics, you would run the procedure with all of the datasets and get a pooled estimate for the statistic.

If there are any other possibilities people know of I would appreciate hearing their thoughts!

Rick_SAS
SAS Super FREQ

As you figured out, PROC MI creates stacked versions of the original data, where the various replicates are indicated by distinct values of the _IMPUTATION_ variable.  You use the BY _IMPUTATION_ statement in your favorite procedure to analyze each imputed value, then use PROC MIANALYZE to aggregate the results. PROC MIANALYZE produces a point estimate for your statistic and a standard error that corrects for the added variance due to imputation.

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 3 replies
  • 1381 views
  • 0 likes
  • 3 in conversation