BookmarkSubscribeRSS Feed
mlensing
Obsidian | Level 7

Hi everyone, 

I am working on a group research project in which we are exploring the research question "Is greater length of residence in the US among immigrant mothers associated with increased risk of preterm delivery and low birth weight?" Our primary predictor is continuous (length of residence in months) while our outcomes are each dichotomous. We are using multivariable logistic regression to perform our analysis and are a little unsure of whether we should use proc logistic, proc genmod, or a different option to calculate relative risk.

Additionally, we are also having trouble defining our reference group. Ideally, we want to compare immigrant mothers to non-immigrant mother, however in our dataset, the values for length of residence are blank/missing for non-immigrant mothers. In this case, is it better to create a new dichotomous variable for immigrant status and use "no" as the reference group in the class statement (as done in the code below) or recode length of residence to set missing values (non-immigrant mothers) to an unrealistic month value (e.g. '999999') and use that as our reference group?

 

So far, we have tried using both proc genmod and proc logistic (code below):

 

proc genmod data = temp descending;

class imgrt (ref = 'N')/param = ref;

model preterm = LORMonths IMGRT/dist = poisson link = log;

run;

 

proc logistic data = temp descending;

class IMGRT (ref = 'N')/param = ref;

model preterm = LORMonths IMGRT;

run;

 

If anyone could explain which procedure is best in this situation to obtain relative risk and/or point us to understandable documentation, we would so appreciate it!

 

2 REPLIES 2
StatDave
SAS Super FREQ

You can't make that immigrant/nonimmigrant comparison if you have LOR in the model since you have no LOR data for nonimmigrants. That comparison would have to be made without LOR in the model. As for estimating the relative risk see this note which shows how you can do that using PROC LOGISTIC followed by the NLMeans macro.

PaigeMiller
Diamond | Level 26

I would think that both PROC GENMOD and PROC LOGISTIC ought to work, and give the same answers. I see that you have chosen DIST=POISSON for PROC GENMOD and that doesn't sound right, you want to use DIST=BINOMIAL.


Regarding the mothers who are immigrants and don't have a length of residence, I think you need to fit two different models, one for immigrants and one for non-immigrants. The model for immigrants would of course not include length of residence.

--
Paige Miller

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 451 views
  • 2 likes
  • 3 in conversation