I'm trying to run a model using Proc Logistic but I get an error message saying "no valid observations". I read that including a variable whose values are 100% missing in a proc logistic model results in this error. I went back and found that some of my variables did indeed have values that were 95 - 100% missing, so I took them out of the model but I'm still getting this error message. Can someone tell me any other reasons as to what might be causing this??
I am not sure if this is something that you have thought about but it may be possible that people with answers to one of your indicator variables may not have responses to another variable. This was the case in one of my examinations where people who responded to one question in my survey often skipped a question later on.
This may look like what you have already said, but when a large number of variables are used as indicators, it is possible for each variable to have a high response rate yet still get the error message. One requirement for an observation to be included in the logistic regression is that it must have a complete response to ALL variables in the model. Take the below as an example.
proc logistic data=one;
model result=age height weight BMI WHR;
Although this is a small dataset it outlines that while no individual variable has lower than an 80% response rate, because each observation has a missing for at least one of the model variables I get the error message you mentioned.
You might want to think about doing some investigation into imputation, where the responses you do know can help you impute the missing values and you can still utalize the variables in the model.