BookmarkSubscribeRSS Feed
TomTom
Calcite | Level 5

Hello,

I have the following problem:

I fitted a model using proc genmod. I stored this model in the item-store using the STORE statement and I now intend to use this model to score a different dataset using proc plm. However, I would like to modify the stored model before using it for scoring. Is it possible to do this? My modification is something like (a bit more complicated actually): add 1 to each parameter.  Naively I tried to access the stored model in a data step which doesn't seem to be possible.

Many thanks for your help

5 REPLIES 5
SteveDenham
Jade | Level 19

Add 1 to each parameter?  That seems, well, unusual.  I could understand adding 1 to each independent value (say if you had a log link), but adding a constant to the estimates doesn't seem like a very good idea, let alone doing something more complex.  The values would no longer be the MLE's, and hence scoring would be biased  Could you give the motivating reasons for the process?  And maybe what the modification you want to apply will actually be.

Steve Denham

TomTom
Calcite | Level 5

Well, what intend to do is actually more like adding 1 to each variable, however, I thought it would be neat to do this via the stored model and this got me generally interested in whether I can modify these item-store objects.

I essentially want to do what Joyce et al did in in this paper http://www.ncbi.nlm.nih.gov/pubmed/12407470

They calculate adjusted mortality rates.Their correction is done by subtracting the observed value of a confounder from the mean value of this confounder in the dataset and adding this term to the observed outcome (mortality rate). So for one independent variable the adjusted rate is given by

y_adj = y_obs + b (x_mean - x_obs)

SteveDenham
Jade | Level 19

OK.

I think you will have to use a datastep solution.  The real key here is what you use for x_mean--the mean of the x values from the original dataset (where the coefficients were derived), the mean of the x values from the new data you want to score, or the mean of the x values for the combined datasets.  If we believe that the coefficients are unbiased estimators of the population, then we probably want to use the mean of the x values from the new data.  It becomes a matter of outputting the coefficients to one data set, calculating x_mean (new) and saving it to a dataset, merging the mean back against the original data, calculating the difference, multiplying by the coefficient and adding to the new y value.  Scale this up across all of the x's.

Steve Denham


TomTom
Calcite | Level 5

Hi Steve

Which values to use for the calculations of x_mean was also giving me a bit of a headache. Thanks for your thoughts, very useful.

Thanks also for the datastep solution to my problem. It's not that easy, I think, that's why I wondered whether I can fiddle with a model in the item-store. Anyway, it looks like nobody got an answer to my question (or the experts are too busy) so I'll have a try at your datastep solution.

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 5 replies
  • 1600 views
  • 3 likes
  • 3 in conversation