About JacobSimonsen

JacobSimonsen · ‎04-25-2015

No, its opposite. Continous variables should not be included in the class statement. Categorical variables should be included in the class statement.

JacobSimonsen · ‎04-24-2015

When the wald test (or log-rank or -log(LR)) are significant it can be due to only one big contrast between two Groups, and not neccesaryly that all four Groups differ from each other. It is correct as you say that four Groups can not be assumed to be equal. Proc phreg produce Wald, LogRank and -2Log(LR), they are asymptotic equivalent. You can write "type3(wald)" as option in the modelstatement if you only want the wald test.

JacobSimonsen · ‎04-24-2015

The p-value of 0.0049 test the hypothesis that all three parameters are zero. DF=3 because there are three less in a model where all the four Groups are equal (four Groups because there is also the reference Group).

JacobSimonsen · ‎04-10-2015

It is not always meaningfull to use the disperson index to select a model. If you have time-to-event data, and summarize these data with number of events and number of personyears on each combination of covariates, then you can analyze the data with Poisson-regression. But, that is just a trick to make inference about parameters. In such case you can not use anykind of statistics that make use of the poisson-distribution, therefore you can also not use the dispersonindex and analyze the data with negative binomial instead. Even if you generate your own time-to-event data with piecewise-constant hazard-rates, and analyze with Poisson regegression you can observe a dispersion index far from 1. Even though that all assumptions for Poisson regression was fulfilled. The reason why the p-values can change so dramatically is that in the negative binomial distribution it involves a variance parameter. If your data is aggregated on one more covaraite (even that this covariate is not included in the model!) then the variance parameter will be much smaller, which in turns will make confidenceintervals more narrow and p-values smaller. I can illustrate the princip in this simple example: Let say you just have one covariate "a". then data mydata; input a count personyears; logpop=log(personyears); cards; 0 10 10 1 20 40 ; run; proc genmod data=mydata; class a; model count=a/dist=nb link=log offset=logpop; estimate 'a' a 1 -1; run; Now, same data, but in addition a covariate "b" is also observed and data is aggregated on "b" as well; data mydata; input a b count personyears; logpop=log(personyears); cards; 0 0 6 5 0 1 4 5 1 0 10 20 1 1 10 20 ; run; proc genmod data=mydata; class a; model count=a/dist=nb link=log offset=logpop; estimate 'a' a 1 -1; run; *with the negative binomial model you will get other p-values, but same mean estimates. *with Poisson you will get same results with the two models.;

JacobSimonsen · ‎04-09-2015

Hi PaigeMiller, If I remember right, if you consider your data to be normal distributed, then the maximum likelihood will also be the estimate obtained by ordinary least squares. Therefore I think also that nlmixed can solve the problem. Good luck. Jacob

JacobSimonsen · ‎04-09-2015

I agree with that it may not be wise to make such a bound on the parameters. Also, in case some of estimates happens to be on the bound the p-values will not be valid. Nevertheless, if you still insist to do this, it is easy to put restrictions on the parameters in nlmixed, which can handle most regression models. I found this example in the SAS-documentation where they use a bound on the parameters: http://support.sas.com/documentation/cdl/en/statug/67523/HTML/default/viewer.htm#statug_nlmixed_examples03.htm Jacob

JacobSimonsen · ‎04-09-2015

Hi Wernie, First of all, I think it is wrong to use the negative binomial distribution to model number of events in a population study. The reason why you can use Poisson distribution with "logpop" as offset is that the likelihood function will be similar to what you get if the number of events had been Poisson-distributed. You can make inference similar to what you would do if events were Poisson-distributed, but you can not make model-assessment were you use the distribution-assumption. Therefore, it is not meaningfull to say that data is overdispersed because it is a conclusion that use the distribution-assumption, and I guess it is of that reason you use the negative-binomial distribution. It should not matter whether you include a factor in the class statement which is not used in the model statement. That will give same result. Though, if there are missing values in that variable, these observations will be deleted when the variable is included in the class statement. I will recommend not to include variables in class that are not used in the model. It will matter a lot for the p-values in a negative-binomial model whether or not you aggregate on a variable before you run the analysis even though it is not included it neither the model-statement or the class statement. This is not the case for the poisson-model. Jacob

JacobSimonsen · ‎04-01-2015

It is correct that only timeintervals where an event happens is included in the analysis. That should be understood in the way, that if some person have an event at some time, then other persons interval at that time do matter, because they were at-risk at that time. The programming steps are applied to all records where an event-times occur. That means, the programming steps are applied multiple times to each record, and a number of times proportional to N^2, (or rather N x number-of-events). Event-times are where events occurs. Intervals-endpoints can be either censored or non-censored (equivalent to non-event and event). Typically, all intervals except the last one will be non-events (censored). About the weighted average, I agree on your statement. I wrote it wrong, you did it right!

JacobSimonsen · ‎04-01-2015

I was thinking that the weight used in the model is the weight that a person have it the weight goes linear. That is, a weighted average of the measured weights at the endpoint of an interval, with most weight to the nearest endpoint. if you in the table above add two variables t1 and t2 which is the same as entry and exit (you need to copy them, because the exit variale is used for the running time in the Cox regression). also add "weight1" as the weight measured at the left endpoint and "weight2" the weight at the right endpoint. Further, you need the event-variable that should be 0 at all those intervals that where the person is censored at the right endpoint. It will only take the value 1 at the last interval, and that only in case the person has an event at all. Something like this should Work: proc phreg data=mydata; weight=((exit-t1)*weight1+(t2-exit)*weight2)/(t2-t1); model (entry exit)*event(0)=weight; run; or if you will dichotomise: proc phreg data=mydata; weight=( ( (exit-t1)*weight1+(t2-exit)*weight2)/(t2-t1)>55); model (entry exit)*event(0)=weight; run;

JacobSimonsen · ‎04-01-2015

Hi Kastchei, The weight observed at the first timepoint is used until the second timepoint. That is more easy to see when you have made a table where the exit point is added: entry exit weight category 0 56 55.52477493 > 55 kg 56 140 54.88467677 <= 55 kg 140 229 54.65788059 <= 55 kg 229 359 56.24545388 > 55 kg 359 372 51.48273400 <= 55 kg In your example, if the last time, which is either an event-time or censoring time is observed at 372, then the weight at that time is not used. Alternative, If your are very ambitious, you can smooth out the weights between the observed timepoints, but it well require the assumption that weight is not affected by eventtimes, because otherwise you will conditioning on future events.

JacobSimonsen · ‎03-25-2015

hello, yes, it is possible to get the cumulative incidence functions in a competing risk model. The Fine & Gray method gives you want you want, and it is implemented in the most recent release. It is quite easy. Your censoring variable should also indicate what type of event that occur, and the eventcode option in the model statement is used for specifying the type of event of interest. The modelline should be something like this, assuming that eventtype "1" is the event of interest. model T*Status(0)= X1-X5 / eventcode=1; The status variable here can take other values than 0 (censoring) and 1(event of interest). And with the baseline statement you get the probability funciton out in a dataset with the CIF= keyword.

JacobSimonsen · ‎03-24-2015

Hello Than, I have two suggestions: 1) I think the only reason for why it sometimes Works and sometimes not is the random samples in the assessment. If you dont need to assess the functional form, then you shold omit the assessment statement, and the error will most likely disappear. 2) Also, since preedin is not timedependent, you should calculate logpreedin in a dataset before you run PHREG. Then, phreg will see logpreedin as any other fixed covariates, and the log-transformation will not be used in the assessment neithter (becase logpreedin is then already a transformed variable). Therefore, defining the logpredin in a dataset should also remove the error. Good luck. Jacob

JacobSimonsen · ‎02-22-2015

I agree that the EXP-option will not work with the logit-link. That was not what I meant. The estimate-statement automatically produce the correct estimate on probability scale. Below is the output when the log-link is used. The "Mean Estimate" gives the probability estimate, and also confidence intervals are produced (in bold). Almost same output if logit is used as link. estimate 'intercept' intercept 1; Mean Mean L'Beta Standard L'Beta Chi- Label Estimate Confidence Limits Estimate Error Alpha Confidence Limits Square Pr > ChiSq intercept 0.4561 0.4027 0.5165 -0.7851 0.0635 0.05 -0.9095 -0.6607 152.98 <.0001

JacobSimonsen · ‎02-22-2015

As I see it, Rick's approach by adding the line estimate 'Intercept' int 1 / EXP ; in proc genmod is the easiest way to get the estimate on the probability scale. It works both with log-link and log-link. Only difference is that the confidence interval is slightly difference. The other suggested solutions also works, but I see no reason to make it more complicated.

JacobSimonsen · ‎02-06-2015

Small changes are made: it was requested that the macro should be able to hande weights. So this is now added to an option to the coxaggregate-macro and also added to one of the examples in the article. Further, some more description is added in the top of the macro to better explain how it should be used.

Online Status	Offline
Date Last Visited	‎09-19-2025 09:18 AM

Re: Marginal likelihood function of a fragile survival model and the L...

Re: Train and test split for the proc phreg command

Re: comparing disease rate of a population with a disease rate for a s...

Re: Counting Process and Interactions

Re: Counting Process and Interactions

Re: How does the covariance matrix gets the confidence intervals of re...

Re: Interaction term in the PHREG procedure

Re: Interaction term in the PHREG procedure

Re: Proc Phreg for time varying exposure variable- different output

Re: Cox regression vs Poisson regression for analysis

Re: Why I always get 3:4:5

Re: Modelling a binary data using PROC GENMOD via logit link or linear...

Juletip #12 - How to count days across bank holidays using custom inte...

Re: Pooled Odds Ratio

Re: how I change the title in proc template to show a label of a varia...

Re: Marginal likelihood function of a fragile survival model and the L...

Re: Train and test split for the proc phreg command

Re: Counting Process and Interactions

Re: How does the covariance matrix gets the confidence intervals of re...

Re: Interaction term in the PHREG procedure

A method to speed up PROC PHREG when doing a Cox regression

Estimating additive and multiplicative parameters in a semiparametric ...

Re: A method to speed up PROC PHREG when doing a Cox regression

Re: Proc GENMOD Class question

Re: How to interpret these results !! Hazard Ratio

Re: How to interpret these results !! Hazard Ratio

Re: IRRs in proc genmod

Re: Force a regression coefficient to be negative

Re: Force a regression coefficient to be negative

Re: IRRs in proc genmod

Re: Cox continuous time-varying intervals

Re: Cox continuous time-varying intervals

Re: Cox continuous time-varying intervals

Re: Help with computing cumulative incidence prediction from competing...

Re: PHREG log transformation

Re: Binomial regression model with genmod

Re: Binomial regression model with genmod

Re: A method to speed up PROC PHREG when doing a Cox regression

SAS Community Nordic