BookmarkSubscribeRSS Feed
ZBlue
Calcite | Level 5

Hi ,

This is the problem i am trying to solve for a client , and would appreciate some help :

1) I am trying to predict the "time to default" (along with the associated probability for a particular loan to default in month1 , month2..etc.) within a six month time period for a set of retail loans.

2) The independent variables are loan characteristics (time-invariant) which will be observed over a period of 12 months prior to default e.g. loan amount , tenure , repayment history etc.

3) The nature of the data is such that the dependant variable (time to default) is observed only in discrete units i.e. 1 month , 2 months , 3 months etc.

After going through some of the available literature regarding survival analysis , I would prefer to use a suitable parametric method e.g. LIFEREG (after determining the appropriate underlying distribution), firstly because the methodology/results are easy to interpret and explain and secondly because prediction (for future loans) and validation appear to be simpler compared to proportional hazard methods.

So my question , what would be the best way to use parametric methods while allowing for discrete time independent variables as above ?

Thanks !

1 REPLY 1
Steelers_In_DC
Barite | Level 11

I'm sure you can get more help with the analysis part but I'm wondering if prepping the data would help.  Does this get you anywhere closer to where you want to be?

data have;

infile cards dsd;

informat date mmddyy10.;

format date mmddyy10.;

input date time_to_default;

cards;

10/31/2015,6

;

run;

data want;

format default_date mmddyy10.;

set have;

default_date = intnx('month',date,time_to_default,'e');

run;

Ready to join fellow brilliant minds for the SAS Hackathon?

Build your skills. Make connections. Enjoy creative freedom. Maybe change the world. Registration is now open through August 30th. Visit the SAS Hackathon homepage.

Register today!
What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 1259 views
  • 0 likes
  • 2 in conversation