Turn on suggestions

Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results for

- Home
- /
- Programming
- /
- SAS Procedures
- /
- Re: adjusting for a covariate, maybe clustering?

Options

- RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page

🔒 This topic is **solved** and **locked**.
Need further help from the community? Please
sign in and ask a **new** question.

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

Posted 04-15-2020 11:27 PM
(1117 views)

Hi,

I have tried to study association between outcome-medicine prescribed to patients (it is dose of medicine in continuous number) and some variables like age, ethnicity, smoking status, insurance status and weekday medicine was prescribed. Main reason was to see if dose of medicine prescribed is associated with week day it is prescribed.Something to see like higher doses are prescribed over weekend.

I used linear regression model. Reviewers are saying that this association might be due to specific surgeons operating on specific days of week. There are 90 surgeons , whose patients are in the dataset.

Please let me know how should i handle this question.

Thank you very much in advance!

1 ACCEPTED SOLUTION

Accepted Solutions

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

I would add surgeon into the model, in a perfect world this would account for the effect of surgeon on the prescribed amount of a drug. The problem in the imperfect real worls is that if you have 90 surgeons, you need a lot of data in order to find real effects (and maybe you have a lot of data). But again, in a perfect world, then the effect of weekend *vs*. weekday has the effect of surgeon removed.

I don't see clustering as a possible solution here.

--

Paige Miller

Paige Miller

26 REPLIES 26

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

What is the N? Can you add surgeon as a factor in the model? Or specifically add indicators for surgeons who do weekend/night versus day shifts or however your shifts are assigned.

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

Well, the first thing I would do is look at the distribution of surgeons by day of the week and identify those that are "weekend" surgeons and those that are "weekday" surgeons which is what @Reeza is suggesting. Then I would add this into the mix of variables that you have in your model. I presume that because some of your variables are categorical you are using something like GLM or MIXED to do your linear regression. I make this assumption because a regression that considers ethnicity as continuous variable is probably going to give different results depending on the values assigned. Is this a correct assumption?

SteveDenham

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

I would add surgeon into the model, in a perfect world this would account for the effect of surgeon on the prescribed amount of a drug. The problem in the imperfect real worls is that if you have 90 surgeons, you need a lot of data in order to find real effects (and maybe you have a lot of data). But again, in a perfect world, then the effect of weekend *vs*. weekday has the effect of surgeon removed.

I don't see clustering as a possible solution here.

--

Paige Miller

Paige Miller

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

Hi, I wanted to add one more thing to this.

We have submitted another paper using the same dataset in a different journal. Here we were looking for association between outcome-prescription and different variables patient characteristics including age , gender, ethnicity , history of alcohol abuse etc. , provider characteristic including provider's age, gender , year of practice.

We did mutivariate linear regression Proc GLM and found that patient age, gender, ethnicity, surgeon gender, age, and years in practice was significantly predictive of the amount of opioids prescribed.

We have received a comment from reviewer saying, '*My major concern in the paper is the way that surgeon characteristics are incorporated in the models. There is likely significant clustering by surgeon and thus inclusion of just the characteristics without adjusting for clustering may incorrectly measure the effect of the surgeon predictors on the outcome. A multi-level model (with patient and surgeon-levels) would be more appropriate and account for clustering effects.* '

Plaese let me know what do you think about this. N= 23000 and number of surgeons= 90

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

There is likely significant clustering by surgeon

II don't really know what this means, and I have no background in medical studies, nor do I know if this statement is true or if it is reflected in your data.

So I think you'd have to look into the data and see. Or talk to people in your field who might be able to help. Or both.

--

Paige Miller

Paige Miller

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

Thank you very much for the prompt reply! This community is always a great asset.

Just one help- Can you please direct me to SAS codes which i can use if clustering is a problem and outcome is continuous. (currently using Proc GLM)Thanks!

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

You don't have time of day for the surgery?

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

Thanks for pointing this out to me. I have date of surgery and time of surgery.

I can get weekday of surgery and day, evening, night shift of surgery from there.

I can add day of surgery and shift of surgery to the model.

Sorry, to give too much trouble but how do i explain to reviewer that it cannot be explained by clustering . I think they would like an explanation. Thank you!

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

@Kyra wrote:

Try putting the surgeon's age, gender and years of practice into the model. This could adjust for whatever was meant by "clustering" of surgeons.

--

Paige Miller

Paige Miller

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

Surgeons age, gender, year of practice is already in the model

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

@Kyra wrote:

Surgeons age, gender, year of practice is already in the model

Well, if you are using all the information you have about the surgeon, its hard to see how any type of "clustering" (no matter how it is defined) can improve on that.

--

Paige Miller

Paige Miller

- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content

@Kyra wrote:

Thank you very much for the prompt reply! This community is always a great asset.

Just one help- Can you please direct me to SAS codes which i can use if clustering is a problem and outcome is continuous. (currently using Proc GLM)Thanks!

No, I can't because I can't see how clustering would help here.

However, in thinking about the problem, and do you need to modify the model to take account of something that can be loosely called "clustering" of surgeons ... this is what I thought ... look at the residuals. If there are patterns in the residuals — specifically when you plot the residuals against surgeons, but really you should plot residuals against all variables — this indicates a deficiency of the model (or an incorrect assumption somewhere) and it could be that whatever is going on with the surgeons is causing the pattern (although other things cause patterns). If there is no pattern in the residuals, then you probably don't need to worry about whatever is meant by "clustering" of surgeons.

--

Paige Miller

Paige Miller

**Don't miss out on SAS Innovate - Register now for the FREE Livestream!**

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.