Proc logistic vs proc genmod for continuous predictor in logistic regr...

mlensing · Posted 11-23-2020 12:38 PM

Hi everyone,

I am working on a group research project in which we are exploring the research question "Is greater length of residence in the US among immigrant mothers associated with increased risk of preterm delivery and low birth weight?" Our primary predictor is continuous (length of residence in months) while our outcomes are each dichotomous. We are using multivariable logistic regression to perform our analysis and are a little unsure of whether we should use proc logistic, proc genmod, or a different option to calculate relative risk.

Additionally, we are also having trouble defining our reference group. Ideally, we want to compare immigrant mothers to non-immigrant mother, however in our dataset, the values for length of residence are blank/missing for non-immigrant mothers. In this case, is it better to create a new dichotomous variable for immigrant status and use "no" as the reference group in the class statement (as done in the code below) or recode length of residence to set missing values (non-immigrant mothers) to an unrealistic month value (e.g. '999999') and use that as our reference group?

So far, we have tried using both proc genmod and proc logistic (code below):

proc genmod data = temp descending;

class imgrt (ref = 'N')/param = ref;

model preterm = LORMonths IMGRT/dist = poisson link = log;

run;

proc logistic data = temp descending;

class IMGRT (ref = 'N')/param = ref;

model preterm = LORMonths IMGRT;

run;

If anyone could explain which procedure is best in this situation to obtain relative risk and/or point us to understandable documentation, we would so appreciate it!

StatDave · Posted 11-23-2020 01:00 PM

You can't make that immigrant/nonimmigrant comparison if you have LOR in the model since you have no LOR data for nonimmigrants. That comparison would have to be made without LOR in the model. As for estimating the relative risk see this note which shows how you can do that using PROC LOGISTIC followed by the NLMeans macro.

PaigeMiller · Posted 11-23-2020 01:00 PM

I would think that both PROC GENMOD and PROC LOGISTIC ought to work, and give the same answers. I see that you have chosen DIST=POISSON for PROC GENMOD and that doesn't sound right, you want to use DIST=BINOMIAL.

Regarding the mothers who are immigrants and don't have a length of residence, I think you need to fit two different models, one for immigrants and one for non-immigrants. The model for immigrants would of course not include length of residence.

--
Paige Miller

Proc logistic vs proc genmod for continuous predictor in logistic regression

Re: Proc logistic vs proc genmod for continuous predictor in logistic regression

Re: Proc logistic vs proc genmod for continuous predictor in logistic regression