BookmarkSubscribeRSS Feed
☑ This topic is solved. Need further help from the community? Please sign in and ask a new question.
dinyae1
Calcite | Level 5

A series of incidence and population data is given. How can the incidence change, CI and p-value for the change be determined for the time series:

YearMaleMale population
2012193809086
2013203813246
2014213812580
2015253811591
2016123807555
2017143795242
2018163790916
2019123797092
202073803236
202143787284

 

Thanks for the help if you provide the running code.

1 ACCEPTED SOLUTION

Accepted Solutions
Quentin
Super User

How well does the model fit your data?  You didn't show any fit statistics, but it looks like you've modeled it as a straight line that doesn't fit very well.  The estimate for the year parameter is not significant. Maybe explore some other models.  The raw incident rates are:

 

Year Male Male population Incidence
2012 19 3809086 0.000005
2013 20 3813246 0.000005
2014 21 3812580 0.000006
2015 25 3811591 0.000007
2016 12 3807555 0.000003
2017 14 3795242 0.000004
2018 16 3790916 0.000004
2019 12 3797092 0.000003
2020 7 3803236 0.000002
2021 4 3787284 0.000001

 

That doesn't look linear to me.  Maybe try adding a quadric term (year**2) and see if that helps.

View solution in original post

13 REPLIES 13
ballardw
Super User

P-value implies some sort of statistical test is run. Which test do you want a p-value for?

 

What is your "incidence rate change" measured in? Percent of change, per some standardized value such as per 100,000 individuals?

dinyae1
Calcite | Level 5

Hi. Thanks. Figures absolute numbers of melanoma (per 1000 000 individuals). I did Poisson model for mean annual change. 

Now the % incidence change for the entire time series should be +CI and the p-value for the change.

Thanks for the help again.

Alex

PaigeMiller
Diamond | Level 26

@dinyae1 wrote:

A series of incidence and population data is given. How can the incidence change, CI and p-value for the change be determined for the time series:


Do you want the combined incidence rate over all of the years? Is that the sum of all of the values of MALE divided by the sum of all of the values of MALE POPULATION? Or is the statistic you are thinking about different? If so what are the calculations? Exactly how should we handle the different years shown?

--
Paige Miller
dinyae1
Calcite | Level 5

Hi. I did Poisson model for mean annual change. 

Now the % incidence change for the entire time series should be +CI and the p-value for the change. What is correct SAS code for this problem?

Thanks for the help again.

Alex

PaigeMiller
Diamond | Level 26

@dinyae1 wrote:

Hi. I did Poisson model for mean annual change. 

Now the % incidence change for the entire time series should be +CI and the p-value for the change. What is correct SAS code for this problem?


I don't think this answers any of my questions. EDIT: I see @Quentin has similar questions and it would be very helpful for you to explain what you want to do with this data, show examples of the calculation, etc., as I asked for.

 

The only new information you have provided is that you "did Poisson model". Okay, tell us (or better yet show us the code) how you did the Poisson model. Also please show us the output.

--
Paige Miller
dinyae1
Calcite | Level 5

proc genmod data=insure;

model male = year / dist=poisson link=log offset=ln;
  effectplot / clm moff;
run;

dinyae1_1-1703089684374.png

dinyae1_2-1703089728026.png

 

 

 

ballardw
Super User

Uses a variable LN not defined in any example data so a bit hard to discuss accurately:

 


@dinyae1 wrote:

proc genmod data=insure;

model male = year / dist=poisson link=log offset=ln;
  effectplot / clm moff;
run;

dinyae1_1-1703089684374.png

dinyae1_2-1703089728026.png

 

 

 


 

Quentin
Super User

I'm assuming this is a homework problem, since I can't believe melanoma rates would decrease that quickly.  It's fine to ask for help with homework questions, but it's helpful to describe them as such.

 

It's not clear what you are trying to estimate.  Let's assume for a moment you can fit a perfect model, which will allow you to predict the number of male cases in any year.  What is the incidence change you want to estimate?  Is it 2021 incidence - 2012 incidence? Or is it some sort of average annual change (would seem weird for this data)?

 

If you have a model that fits this data reasonably well, so incidence=f(year), and you have an estimate of a parameter for year, with a standard error, that would allow you to predict incidence in any year, and calculate a confidence interval for that prediction.  And you could subtract the predictions for any two years to get a predicted change between those two years...

dinyae1
Calcite | Level 5

Thank you for your comments. This is a very rare melanoma. I want to estimate the incidence % change for the entire 2012-2021 period.

Quentin
Super User

@dinyae1 wrote:

Thank you for your comments. This is a very rare melanoma. I want to estimate the incidence % change for the entire 2012-2021 period.


When you say " incidence % change for the entire 2012-2021 period", do you mean change from 2012 incidence (19/3809086) to 2021 incidence  (4/3787284)?

What would be the point estimate for the change, just (4/3787284 - 19/3809086) / (19/3809086) = -79% ?

dinyae1
Calcite | Level 5

Rather, I would like to ask if, apart from the Poisson model, is there any other suitable model for examining changes in incidence, time series analysis? Or does the Poisson model give a convincing enough result on its own?

Quentin
Super User

How well does the model fit your data?  You didn't show any fit statistics, but it looks like you've modeled it as a straight line that doesn't fit very well.  The estimate for the year parameter is not significant. Maybe explore some other models.  The raw incident rates are:

 

Year Male Male population Incidence
2012 19 3809086 0.000005
2013 20 3813246 0.000005
2014 21 3812580 0.000006
2015 25 3811591 0.000007
2016 12 3807555 0.000003
2017 14 3795242 0.000004
2018 16 3790916 0.000004
2019 12 3797092 0.000003
2020 7 3803236 0.000002
2021 4 3787284 0.000001

 

That doesn't look linear to me.  Maybe try adding a quadric term (year**2) and see if that helps.

dinyae1
Calcite | Level 5

Thanks for the suggestion, it helped. Homework is done. Merry Christmas.

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 13 replies
  • 5108 views
  • 0 likes
  • 4 in conversation