BookmarkSubscribeRSS Feed
Yang2018
Calcite | Level 5

Hello,

 

 

I have a question related to the topic of multiple imputation. I am hoping that you can help me out.

I am trying to impute missing values in a dataset that has mostly categorical variables (binary, ordinal or nominal) and one continuous variable (age). I am using the discriminant FCS method for missing categorical variables (total 😎 and regpmm for one missing  age (1). The other variables have no missing value.

 

I am doing this in SAS 9.4 (SAS/Stat 14.1) and only set imputation number as 1, I tried to get the imputed data set quickly. This below code can be run without error information, however, it can't give me the final imputed data set, even after 1 day run. Did I include too many missing variables (total 9) in this model? Why it runs but no output after long time waiting, even I just set nimpute=1? What further model modification I can do to get final imputed data set? Thanks. 

 

Here is the example code I used:

 

proc mi data=OUTPT.analysis MU0=35 minimum=0 maximum=108 round=1 nimpute=1 out=OUTPT.imputed;

class SEX_C RACETH_C D_C P_C I_C BD_C DI_C LO_C TY_C IN_C FM_C PS_C ST_C SP_C SC_C PR_C HO_C;

fcs regpmm(AGEYR_N/details);

fcs discrim(RACETH_C/details classeffects=include);

fcs discrim(SEX_C/details classeffects=include);

fcs discrim(P_C/details classeffects=include);

fcs discrim(I_C/details classeffects=include);

fcs discrim(BD_C/details classeffects=include);

fcs discrim(DI_C/details classeffects=include);

fcs discrim(LO_C/details classeffects=include);

fcs discrim(TY_C/details classeffects=include);

var AGEYR_N SEX_C RACETH_C D_C P_C I_C BD_C DI_C LO_C TY_C IN_C FM_C PS_C ST_C SP_C SC_C PR_C HO_C TRE_N;

run;

2 REPLIES 2
Yang2018
Calcite | Level 5

Is there anybody who can help to solve this issue? Why the code can be run but no final imputed data set is available?  Thanks.

SAS_Rob
SAS Employee

Are there any messages in the LOG?  How large is the input data set?  How many levels are there to the categorical variables?

 

I would probably start by reducing the size of the models, especially if there are lot of levels to the categorical variables.

 

Instead of using all the categorical variables for the DISCRIM models, what happens when you use only a few predictors in each model.

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 2284 views
  • 0 likes
  • 2 in conversation