BookmarkSubscribeRSS Feed

Survival Data Mining by Example in SAS® Enterprise Miner™

Started ‎10-21-2015 by
Modified ‎11-30-2015 by
Views 4,476

 

Download the Files (GitHub)

 

This tip is part of the Learn by Example with SAS® Enterprise Miner™ Templates series where a new data mining topic is introduced and explained with one or more example SAS Enterprise Miner process flow diagrams.

 

When you have a time-dependent outcome that you are trying to model—a failure of some sort or customer churn, for example—you might be interested in predicting when, not if, the event is most likely to occur.  In SAS Enterprise Miner, a discrete-time logistic-hazard model is used to perform survival data mining.  This approach allows you to model the event likelihood over time, taking into account censored observations, competing risks, time-varying covariates, and left truncation.  The hazard function produced by the model can be estimated at a future time interval of interest to answer questions like:

 

  • What is the probability a customer will churn by a certain date?
  • Which customers are most likely to churn in the next 3 months?
  • What is the expected remaining time for a customer?

To get started with survival data mining using SAS Enterprise Miner, download the process flow diagrams (XML files) and the accompanying PDF documentation for the following two examples from the GitHub repository at https://github.com/sassoftware/dm-flow/tree/master/SurvivalAnalysis.

 

  1. Survival: An example that shows the basic use of the Survival node and how different values for the Time Interval property can affect the results

 

 image001.png

 

  1. SurvivalTVC: An example demonstrating how to include time-varying covariates in your data when modeling with the Survival node.

 image003.png

 

To run these examples, refer to the README file that is part of the GitHub repository at https://github.com/sassoftware/dm-flow. Please note that these examples were tested with SAS Enterprise Miner 13.2.

 

You can find more information about the Survival node in these videos:

 

Introduction to Survival Data Mining 

 

 

New Features in the SAS Enterprise Miner 12.3 Survival Node

 

 

Comments

Keep coming up with  "Must use one class target" in the survival node.....

 

Got 1 ID, 2 Time IDs, 1 Target and 2 Inputs....

 

The target must have Level=Nominal (in Input Data node or Metadata node), with 0 representing censored obs.

Thank you - I didbn't get that out of any of my reading.

 

How do the system calculate the numbers in "Summary of the Number of Censored and Uncensored Values" section in the results for SurvivalTVC? The number showed in that section was: Failed=1406, Censored=2292, but the actual number that I calculated was event(0)=2214, event(1)=1370, event(2)=114. Why is that?

Version history
Last update:
‎11-30-2015 02:55 PM
Updated by:
Contributors

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

Free course: Data Literacy Essentials

Data Literacy is for all, even absolute beginners. Jump on board with this free e-learning  and boost your career prospects.

Get Started

Article Tags