BookmarkSubscribeRSS Feed
DanBos
Calcite | Level 5

Hi all,

I have a large dataset (19k) and I am using logistic regression to estimate probabilities of experiencing an event at the patient level. I am interested in looking at the effect of a facility characteristic on the odds of said event. This facility characteristic is naturally continuous and I have constructed quartiles. I calculated the crude rate of event within each quartile. When I call for the predicted probabilities from a logistic regression model with patient and disease characteristics, the mean predicted probabilities of event across the facility-characteristic quartiles are very similar to each other as well as the overall crude rate (within a thousandth). I am struggling to understand the following: when I add facility-characteristic quartile to the above model and call for the predicted probabilities, the means of the probabilities of event across the factility-characteristic quartiles are equivalent to the crude rates. I understand that the mean of the predicted probabilities will be equivalent to the crude rates across facility-characteristic quartiles when it is the only predictor in the model, but it is not here. Can anyone help me understand why this is? Happy to provide coefficients and output. 

 

With much appreciation!

1 REPLY 1
Rick_SAS
SAS Super FREQ

I think your question will be clearer if you would post the SAS code that you are using. It is difficult to follow your question. My best guess is that the coefficient of the "facility characteristic" is relatively small so that the predictions of the model WITHOUT the "facility characteristic" is essentially the same as WITH the "facility characteristic."

 

In general, I find that interpretation is improved by graphing the probabilities. The easiest way to do that is to use the EFFECTPLOT statement.  I've written an article about how to use the EFFECTPLOTS statement to visualize predicted values of regression models, and the example is a logistic model, so you should be able to adapt the code in the blog post to your case.

 

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 1154 views
  • 0 likes
  • 2 in conversation