I used the following LOGISTIC procedure to predict the variable group, but cannot understand the output. The output showed that both levels age='4' and age='X' have DF=0 and estimates as 0. While age=’A’ is understandably the reference group, the age=’4’ group is questionable. I checked the data using PROC FREQ and found there is so such problem as quasi-complete separation. Age and sex are not perfectly related. The only problem may be from the fact that: IF sex is gay (sex=’G’), then age is definitely unknown (age=’X’). Except this situation, there is no relationship between sex and age. I tried again by combining age=’4’ into the group of age=’3’ (i.e. age’s levels are 1, 2, 3 and X), and ran the model again. This time, the level age=’3’ has the same problem with DF=0, Estimate=0. PROC LOGISTIC DATA=work.data DESC; CLASS sex age; MODEL group=sex age /AGGREGATE SCALE=NONE; /*To model group=0 or 1*/ RUN; Analysis of Maximum Likelihood Estimates Standard Wald Parameter DF Estimate Error Chi-Square Pr > ChiSq Intercept 1 -8.5254 0.0925 8502.1673 <.0001 sex F 1 0.0686 0.0760 0.8144 0.3668 sex G 1 0.1554 0.1115 1.9435 0.1633 sex M 0 0 . . . age 1 1 0.2709 0.1690 2.5692 0.1090 age 2 1 0.0275 0.1461 0.0354 0.8507 age 3 1 -0.2961 0.1285 5.3097 0.0212 age 4 0 0 . . . age X 0 0 . . .
... View more