BookmarkSubscribeRSS Feed
☑ This topic is solved. Need further help from the community? Please sign in and ask a new question.
Kastchei
Pyrite | Level 9

Hello, everyone!

 

I have a model that incorporates race as (Whites vs other) and ethnicity as (Hispanic vs not).  The PI wants to look at non-Hispanic Whites vs all others as a contrast.  This is easily enough done by including the interaction in the model and writing:

lSMestimate race2*hispanic2 'All Others vs. Non-Hispanic White'  1 -3
                                                                 1  1 / divisor = 3;

It turns out that this interaction is very not significant (Type 3 p = 0.70).  I realize that the interaction staying in the model while not significant is not likely to affect my results much.  But if I did want to remove the interaction from the model and rerun, could I still do the same comparison: non-Hispanic Whites vs all others?  I've written this code, and it's estimable, but my hunch is that theoretically, the model can no longer say anything about a combination of the two groups, even though I technically can form the estimates and compare their difference.

*estimate 'Hispanic Non-White'            intercept 1 race2  0  1 hispanic2  1  0;
*estimate 'Non-Hispanic Non-White'        intercept 1 race2  0  1 hispanic2  0  1;
*estimate 'Hispanic White'                intercept 1 race2  1  0 hispanic2  1  0;
estimate 'Avg. of All Others' intercept 3 race2 1 2 hispanic2 2 1 (divisor = 3), 'Non-Hispanic White' intercept 1 race2 1 0 hispanic2 0 1, 'Avg. of All Others vs. Non-Hispanic White' race2 -2 2 hispanic2 2 -2 (divisor = 3);

This does give an answer in the same ballpark as the model with the interaction, which might be expected since the interaction was not significant.

 

My reason for asking is because my real model has 8 racial groups and 3 ethnicity groups, and running the model with an interaction is not possible (quasi-complete separation in logistic).  I dichotomized race and ethnicity to get an answer to the study question, but I am just curious if I could have kept the full responses, omitted the interaction, and still answer the question.  The answer I get when I do that is very different from either of the dichotomized answers above.  I want to understand if that difference is likely because what I did is invalid or if keeping all the detail about the other racial and ethnic categories might really have changed the result a lot.

 

Warm regards,

Michael

 

 

1 ACCEPTED SOLUTION

Accepted Solutions
StatDave
SAS Super FREQ
I don't see anything wrong with your approach, but keep in mind that if you make that comparison in the model that omits the interaction, the effect of White is assumed by the model to be totally unaffected by ethnicity even if there is some small, but insignificant effect of ethnicity. BTW, you might find it more intuitive to create a single variable that has a unique value for each combination of race and ethnicity and use that in place of race, ethnicity and their interaction. That would allow you to use the LSMESTIMATE statement to write the comparison(s) of interest.

View solution in original post

2 REPLIES 2
StatDave
SAS Super FREQ
I don't see anything wrong with your approach, but keep in mind that if you make that comparison in the model that omits the interaction, the effect of White is assumed by the model to be totally unaffected by ethnicity even if there is some small, but insignificant effect of ethnicity. BTW, you might find it more intuitive to create a single variable that has a unique value for each combination of race and ethnicity and use that in place of race, ethnicity and their interaction. That would allow you to use the LSMESTIMATE statement to write the comparison(s) of interest.
Kastchei
Pyrite | Level 9

Thank you!  And duh!  I don't know why I just didn't make a combined variable.  It's so obvious now, haha.

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 385 views
  • 1 like
  • 2 in conversation