BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
SIlver_Geo
Calcite | Level 5

Hello, I am new in SAS Miner and I am trying to understand the following:

1)Why when I use an interactive group node I have as an output both Grouping and WoE variables?

2)Why the Regression Node uses both group and woe variables in the final model?

I am trying to do a coarse classification and then a logistic regression.

I am attaching a screenshot of the project, Interactive Group node & Regression Node settings and the output of the regression. Am I doing something wrong?

Thank you in advance!


Screen Shot 2014-05-10 at 00.26.21.pngScreen Shot 2014-05-10 at 00.24.54.pngScreen Shot 2014-05-10 at 00.26.44.png
1 ACCEPTED SOLUTION

Accepted Solutions
M_Maldonado
Barite | Level 11

Hi Silver Geo,

If I understand the Interactive Grouping and Scorecard nodes correctly,

1) You get both the groupings and the weights of evidence in case you have a preference to fit a logistic regression using either of them. Notice that the fourth train property in the Scorecard node is Analysis Variables and you can specify it as WOE (by defualt) or Group.

Most SAS Enterprise Miner customers would use the WOE.

2) The Scorecard node is set to automatically use either the WOE or Groups. This is not the case with the Regression node. The Regression node gives you the flexibility of additional options like the type of regression and the link function, but you have to also take control of the inputs that you use. A case where you might like to take this approach would be when you are interested in a linear regression, but you would have to use either the WOE or the Group variables as inputs.

For your coarse classification and logistic regression, I would advise using the Interactive Grouping node and the Scorecard node. These two nodes are well suited for logistic regression and you also get a scorecard format with points that are easy to interpret. By default, every 20 points the odds of an event double.

I hope it helps,

Miguel

View solution in original post

2 REPLIES 2
M_Maldonado
Barite | Level 11

Hi Silver Geo,

If I understand the Interactive Grouping and Scorecard nodes correctly,

1) You get both the groupings and the weights of evidence in case you have a preference to fit a logistic regression using either of them. Notice that the fourth train property in the Scorecard node is Analysis Variables and you can specify it as WOE (by defualt) or Group.

Most SAS Enterprise Miner customers would use the WOE.

2) The Scorecard node is set to automatically use either the WOE or Groups. This is not the case with the Regression node. The Regression node gives you the flexibility of additional options like the type of regression and the link function, but you have to also take control of the inputs that you use. A case where you might like to take this approach would be when you are interested in a linear regression, but you would have to use either the WOE or the Group variables as inputs.

For your coarse classification and logistic regression, I would advise using the Interactive Grouping node and the Scorecard node. These two nodes are well suited for logistic regression and you also get a scorecard format with points that are easy to interpret. By default, every 20 points the odds of an event double.

I hope it helps,

Miguel

SIlver_Geo
Calcite | Level 5

Dear Miguel,

Thank you for your assistance, I made quite an extensive reading and I was able to understand the whole process. I used in the end only WoE variables since in behavioural credit scoring can make more sense and show the risk for every variable.

Best Regards,

George

SAS Innovate 2025: Call for Content

Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!

Submit your idea!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 4708 views
  • 0 likes
  • 2 in conversation