Hello,
I am running a logistic regression with four predictor variables, 1 outcome variable, and 6 covariates. My N is around 2000 participants.
For one of my four predictor variables, there are around 1000 people missing the question. The question asked was very sensitive and it is normal for people to skip it on surveys per other papers. The situation is similar for one of my covariates, where a question on a sensitive topic is also skipped by a substantial number of people.
Right now when I run it, my data set ends up cut in half. My major professor will not let me do multiple imputations and wants me to do Full Information Maximum Likelihood to account for the missing data.
Does anyone have any advice on if I can do this in SAS?
Thank you!
Some people will fill in the mean of the data in place of the missings and also create a new variable which is 0 if the original variable was not missing and 1 if the original variable was missing. Then you put the original variable and the new variable into the model. This essentially predicts the observations with the original variable missing at a constant offset from the observations when the original variable was not missing.
Do you want impute missing value by EM algorithm ?
@Rick_SAS wrote a blog about it recently .
The SAS documentation has an example of how to compute FIML by using structural equation modeling with PROC CALIS.
Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!
SAS' Charu Shankar shares her PROC SQL expertise by showing you how to master the WHERE clause using real winter weather data.
Find more tutorials on the SAS Users YouTube channel.