Download the Files (GitHub)
This tip is part of the Learn by Example with SAS® Enterprise Miner™ Templates series where a new data mining topic is introduced and explained with one or more example SAS Enterprise Miner process flow diagrams.
When you have a time-dependent outcome that you are trying to model—a failure of some sort or customer churn, for example—you might be interested in predicting when, not if, the event is most likely to occur. In SAS Enterprise Miner, a discrete-time logistic-hazard model is used to perform survival data mining. This approach allows you to model the event likelihood over time, taking into account censored observations, competing risks, time-varying covariates, and left truncation. The hazard function produced by the model can be estimated at a future time interval of interest to answer questions like:
What is the probability a customer will churn by a certain date?
Which customers are most likely to churn in the next 3 months?
What is the expected remaining time for a customer?
To get started with survival data mining using SAS Enterprise Miner, download the process flow diagrams (XML files) and the accompanying PDF documentation for the following two examples from the GitHub repository at https://github.com/sassoftware/dm-flow/tree/master/SurvivalAnalysis.
Survival: An example that shows the basic use of the Survival node and how different values for the Time Interval property can affect the results
SurvivalTVC: An example demonstrating how to include time-varying covariates in your data when modeling with the Survival node.
To run these examples, refer to the README file that is part of the GitHub repository at https://github.com/sassoftware/dm-flow. Please note that these examples were tested with SAS Enterprise Miner 13.2.
You can find more information about the Survival node in these videos:
Introduction to Survival Data Mining
New Features in the SAS Enterprise Miner 12.3 Survival Node
... View more