BookmarkSubscribeRSS Feed
wgao
Calcite | Level 5

I am working on models to predict hospital readmission. The data were arranged by admissions (claim ID) with a flag to indicate whether there was readmission followed. A member (person) could have multiple admissions, which meant one member ID could correspond to multiple claim IDs. How do I take this repeated measure into account in SAS EM? I did not find any place I could define repeated variable in regression, decision tree or neural network. I suspect probably variable role in SAS EM could clarify this? But how? When I raised this question in one SAS user group meeting, people from bank system told me that my scenario is equivalent to theirs in which they have person level data as well as account level data (one person could have multiple accounts). But they could not answer my question.

Does anyone have any experience? Thanks.

3 REPLIES 3
rayIII
SAS Employee

Hi, wgao.

I'm not an expert on recurring events, but I'd approach the problem like this:

Suppose each individual patient record were coded 1,2,...n based  on the 'episode'. Examine whether the factors affecting probability of readmission are the same across episodes.

You can investigate this by building stratified models of readmission for each episode (you might need to combine the highest numbers of episodes since you're likely to see relatively few patients with really large numbers of episodes). Some basic visual exploration may help as well.

If there isn't a systematic relationship between episode and the factors affecting readmission, then you can simply ignore the episodes, and build a single model treating all patient records equally. If your ability to predict readmission does depend on the episode, then you could include it as an model input or go with stratified models. (I'd try try both and see which gives the best predictions in validation.)

Hope this helps,

Ray

JBerry
Quartz | Level 8

Outside of E-miner, PHREG would handle this (proc logistic has STRATA too, depending on your version). I have not done this inside E-miner, though.

Perhaps you can think about your problem like a Cox regression and model time-until-readmission. E-miner does this very well using the Survival node (under the Applications toolbar) which I believe uses PHREG behind the scenes.

Josh

Funda_SAS
SAS Employee

Hi Wgao,

Repeated measures analysis is not available in EM. I would look into PROC MIXED and PROC GLIMMIX which would enable you to specify specific correlation structures for the observations that come from the same person.

Note that if you have a very large data set with large number of predictors, fitting mixed models can be computationally very exhaustive. If that is the case, try HPMIXED which might provide batter performance.

Thanks.

Funda

SAS Innovate 2025: Save the Date

 SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!

Save the date!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 3 replies
  • 1538 views
  • 0 likes
  • 4 in conversation