Re: Why am I Getting Different Odds Ratios? Display relative risk in p...

krueg314 · Posted 04-28-2020 09:56 AM

proc surveylogistic data=data;
class TARGET b c;
model TARGET (event='1') = a b c d e f g / clparm;
strata STRATA;
cluster PSU;
weight WEIGHT;
run;

proc surveyfreq data=data;
strata STRATA;
cluster PSU;
weight WEIGHT;
tables (a b c d e f g)*TARGET / RelRisk clparm;
run;

I am interested in the odds ratio of variable b has on TARGET. I see different odds ratios for proc surveylogistic and proc survey freq, and manually proc surveyfreq makes sense when I take the weighted values. Why am I seeing different odds ratios and what can I do to fix? At least can I see relative risk in proc surveyfreq as that's the model I'm using.

ballardw · Posted 04-28-2020 10:41 AM

Without your data it is hard to tell exactly.

On possible cause is that Surveyfreq and Surveylogistic will treat missing values a bit differently. If any variable on the model statement is missing (unless the MISSING option is included on a the Class statement) then the entire record is not used for modeling (pretty common to most of the modeling procedures). Read the diagnostics about how many records are in the data set and how many actually used for the model.

krueg314 · Posted 04-28-2020 11:04 AM

Ok I think the key is to drop all missing values before running because the proc surveyfreq only accounts for the only missing values of b and TARGET, rather than the other ones droped by the regression.

krueg314 · Posted 04-28-2020 11:18 AM

I have tried this to delete all the missing before surveyfreq, but the odds ratio still a bit different 😕

data byemiss;
set data (KEEP=TARGET a b c d e f g STRATA PSU WEIGHT);
if nmiss(of _numeric_) > 0 then delete;
run;

StatDave · Posted 04-28-2020 11:14 AM

Beyond the issue of missing values, the results will still differ since the odds ratio estimates for any variable provided by SURVEYLOGISTIC are adjusted for the effects of the other variables in the model. The estimates from SURVEYFREQ are not adjusted for the other variables.

krueg314 · Posted 04-28-2020 11:21 AM

After accounting for missing this must be the reason. Any way I can still get the relative risk in the the proc surveylogistic statement, since I want to account for these interactions?

StatDave · Posted 04-28-2020 11:32 AM

Use the STORE, LSMEANS, and ODS OUTPUT statements in SURVEYLOGISTIC followed by the NLMeans macro as illustrated (using PROC LOGISTIC) in this note.

SteveDenham · Posted 04-28-2020 01:33 PM

I really, really, really wish that this macro had been around back when I first used PROC LOGISTIC and GENMOD. I was happily including ORs in stuff that went to study management, but they wanted everything expressed as relative risk, since that is what PROC FREQ generates and that is what they were used to.

SteveDenham

Why am I Getting Different Odds Ratios? Display relative risk in proc surveylogistic?

Re: Why am I Getting Different Odds Ratios? Display relative risk in proc surveylogistic?

Re: Why am I Getting Different Odds Ratios? Display relative risk in proc surveylogistic?

Re: Why am I Getting Different Odds Ratios? Display relative risk in proc surveylogistic?

Re: Why am I Getting Different Odds Ratios? Display relative risk in proc surveylogistic?

Re: Why am I Getting Different Odds Ratios? Display relative risk in proc surveylogistic?

Re: Why am I Getting Different Odds Ratios? Display relative risk in proc surveylogistic?

Re: Why am I Getting Different Odds Ratios? Display relative risk in proc surveylogistic?