Dear Colleagues,
I am inquiring to know if it is possible to execute a survival random forest analysis using SAS. I have been examining the PROC HPFOREST procedure, but I have not encountered any features related to survival analysis within this context.
I have come across a paper wherein survival was considered using a discrete-time logistic regression model.
Thank you in advance.
Best regards,
Iuri Leite
Since you "have come across a paper wherein survival was considered using a discrete-time logistic regression model." perhaps you can share a link to that paper so we have some clue what you may be contemplating.
@iuri_leite wrote:
Dear Colleagues,I am inquiring to know if it is possible to execute a survival random forest analysis using SAS. I have been examining the PROC HPFOREST procedure, but I have not encountered any features related to survival analysis within this context.
I have come across a paper wherein survival was considered using a discrete-time logistic regression model.
Thank you in advance.
Best regards,
Iuri Leite
Dear @ballardw ,
here is the link of the application of random survival forest using the discrete-time hazard model.
There is also this paper with an application using R.
https://www.sciencedirect.com/science/article/pii/S2589537021003138
Best regards,
Iuri Leite
Hello @iuri_leite ,
I think you're confusing two things.
I use discrete-time Logistic Hazard Regression regularly and you do that with PROC LOGISTIC or PROC LOGSELECT (or PROC GENMOD or PROC HPGENSELECT).
Powerful technique, ... versatile and flexible.
That Random-Forest Machine Learning technique is a different approach.
Let us know if you would like more information about this PROC LOGISTIC approach to survival analysis. There is a complete course on this subject at SAS of a few hundred pages.
Greet, Koen
Dear @sbxkoenk,
thanks for sending me an email.
I used the discrete-time hazards model few times (with logistic and log-log link), but I am not interested in using it. I am interested in using the random forest survival model for variable selection.
In the paper I posted, It seems to me that it was used a survival forest for a data structured for the implementation of a discrete-time logistic regression.
I will read the papers again.
Thanks a lot.
Regards,
Iuri Leite
Hello @iuri_leite ,
I initially thought you were talking about Random Survival Forests scikit recently released. That was purely technical variation.
The claim paper you provided shows typically how survival analysis has been done. As you see, there isn't any special invention on the HPFOREST procedure part to extend the model from logistic approaches. As in all survival analysis, heavy duty is in data preparation. If you have analysis data ready for logistic regression, it is ready for HPfOREST.
Jia
Join us for SAS Innovate 2025, our biggest and most exciting global event of the year, in Orlando, FL, from May 6-9. Sign up by March 14 for just $795.
ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.
Find more tutorials on the SAS Users YouTube channel.