BookmarkSubscribeRSS Feed
Therain
Calcite | Level 5

Hello,

 

I use SAS 9.4 and I am looking at how the presence or absence of disease at baseline year would affect the number of health care use in the next 2 years. I have a data set like this:

id     baselinedisease     healthuse

1      0                              3

2      0                             15

3      1                              2

4      0                              5

5      1                              0

...

However, some people developed the disease at year 1 post baseline or year 2 post baseline, and I think this would affect their subsequent number of health care use. So I want to reflect this in the model. I have another data set showing me the presence or absence of disease for each participant in year 1 and year 2 post baseline. The current codes I am using is:

 

proc genmod data=have;
class disease (ref='0') / param=glm;
model healthuse = disease / type3 dist=poisson;
run;

Does anyone can help me figure out the problem? Thank you in advance!

5 REPLIES 5
sld
Rhodochrosite | Level 12 sld
Rhodochrosite | Level 12

Would it make sense to create a factor (Occurence) with three levels?

Occurence=0 if no disease at t=0, t=1, or t=2

Occurence=1 if no disease at t=0, disease at t=1 and t=2

Occurence=2 if no disease at t=0 or t=1, disease at t=2

 

Each id would belong to one and only one level. 

Therain
Calcite | Level 5
Hello, thank you for the reply! I do not know if my understanding is correct, but I am thinking the new disease can only happen in those without disease at baseline. So if we create another factor and include it in the model, it may not be necessary because this factor cannot be the confounder (i.e. No new developed disease in those with disease at baseline)?
sld
Rhodochrosite | Level 12 sld
Rhodochrosite | Level 12

I was thinking of replacing baselinedisease (with two levels) with Occurence (with 3 levels):

 

model healthuse = Occurence / type3 dist=poisson;

Whether this makes sense might depend on which two years you counted healthuse (e.g., the two years following the baseline assessment?), as well as other considerations.

Therain
Calcite | Level 5

Hello, Thank you for the clarification and sorry for the late reply. I think if we replace it with this, the research question would become comparing incident and non-incident disease groups, instead of comparing having the disease or not. Now I am considering using something like the following codes:

 

proc genmod data = have;
class study_id disease;
model healthuse = disease / dist=poisson link=log type3;
repeated subject = study_id / type=ar(1) corrw;
lsmeans disease/diff cl;
run;

 

There is also another file seems related to this issue: https://methodology.psu.edu/downloads/tvem

 

I will keep working on this and to see if this works.

sld
Rhodochrosite | Level 12 sld
Rhodochrosite | Level 12

You could write a contrast for the levels of Occurrence that compares level 0 (non-incident disease group) to the mean of levels 1 and 2 (incident disease groups).

SAS Innovate 2025: Save the Date

 SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!

Save the date!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 5 replies
  • 1710 views
  • 2 likes
  • 2 in conversation