I am running a series of logistic models for an analysis using 4 different survey sample data bases. Each has a stratified sample and sample weights (and psu/cluster information). I am running SAS 9.2 Maintenance Release 1. I ran a sample model using PROC LOGISTIC with a weight, PROC SURVEYLOGISTIC with a weight and stratum variable, PROC LOGISTIC unweighted, and then in desperation SAS-Callable SUDAAN 10 PROC RLOGIST (Logistic.) The R-Squares using PROC LOGISTIC with a weight are improbably high, and so are the R-Squares from SURVEYLOGISTIC with a weight and stratum. The R-Squares for unweighted LOGISTIC are more reasonable although the adjusted one is high. The R-Square for the SUDAAN logistic seem reasonable, a little higher than the unadjusted unweighted logistic. What do I believe here? Is there something weird about the SAS logistics and r-squares and weights??? Is there something I can specify to "fix" this? Any help appreciated!
The one contribution that I can add to your dilemma is that LOGISTIC is inappropriate. See this comment from the documentaiton:
"CautionROC LOGISTIC does not compute the proper variance estimators if you are analyzing survey data and specifying the sampling weights through the WEIGHT statement. The SURVEYLOGISTIC procedure is designed to perform the necessary, and correct, computations.."