Do I need to suppress the intercept in proc surveylogistic when using the strata statement to avoid the dummy variable trap (SAS 9.4)? I see that when I use the strata statement with proc logistic, the model automatically suppresses the intercept. When I use the strata statement with proc surveylogistic, the intercept is not suppressed, and I get very different coefficient estimates on other variables of interest. Why do proc logistic and proc surveylogistic handle the intercept differently under the Strata statement?
The STRATA statement in the two procedures have very different functionality attached to them. In Proc SURVEYLOGISTIC, it is used to identify the strata for a complex survey design. In Proc LOGISTIC, it is used to idenitfy the matched pairs for running models with n:m matching.
Put another way, the STRATA statement in LOGISTIC runs a stratified logistic regression while in SURVEYLOGISTIC it runs a logistic regression from a stratified sample.
Unless you had reason to restrict the intercept to zero in SURVEYLOGISTIC, you would not want to use the NOINT option.
The STRATA statement in the two procedures have very different functionality attached to them. In Proc SURVEYLOGISTIC, it is used to identify the strata for a complex survey design. In Proc LOGISTIC, it is used to idenitfy the matched pairs for running models with n:m matching.
Put another way, the STRATA statement in LOGISTIC runs a stratified logistic regression while in SURVEYLOGISTIC it runs a logistic regression from a stratified sample.
Unless you had reason to restrict the intercept to zero in SURVEYLOGISTIC, you would not want to use the NOINT option.
You might also consider looking at GENMOD with the REPEATED statement because GEE models do not require estimation of a parameter for each set of correlated observations (i.e. panels).
LOGISTIC with the STRATA statement uses a conditional model (or GENMOD with the STRATA and EXACT statements in GENMOD for exact, conditional estimation for other models--this approach obviously wouldn't work with the REPEATED statement)
Either way SURVEYLOGISTIC would not be what you would want to use.as can be done by using the STRATA statement in PROC LOGISTIC for logistic models
Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!
ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.
Find more tutorials on the SAS Users YouTube channel.