turn on suggestions

Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results for

Find a Community

- Home
- /
- Analytics
- /
- Data Mining
- /
- Usage of WoE or Grouping Variables In Regression N...

Topic Options

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Highlight
- Email to a Friend
- Report Inappropriate Content

05-09-2014 07:33 PM

Hello, I am new in SAS Miner and I am trying to understand the following:

1)Why when I use an interactive group node I have as an output both Grouping and WoE variables?

2)Why the Regression Node uses both group and woe variables in the final model?

I am trying to do a coarse classification and then a logistic regression.

I am attaching a screenshot of the project, Interactive Group node & Regression Node settings and the output of the regression. Am I doing something wrong?

Thank you in advance!

Accepted Solutions

Solution

05-13-2014
10:43 AM

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Highlight
- Email to a Friend
- Report Inappropriate Content

Posted in reply to SIlver_Geo

05-13-2014 10:43 AM

Hi Silver Geo,

If I understand the Interactive Grouping and Scorecard nodes correctly,

1) You get both the groupings and the weights of evidence in case you have a preference to fit a logistic regression using either of them. Notice that the fourth train property in the Scorecard node is Analysis Variables and you can specify it as WOE (by defualt) or Group.

Most SAS Enterprise Miner customers would use the WOE.

2) The Scorecard node is set to automatically use either the WOE or Groups. This is not the case with the Regression node. The Regression node gives you the flexibility of additional options like the type of regression and the link function, but you have to also take control of the inputs that you use. A case where you might like to take this approach would be when you are interested in a linear regression, but you would have to use either the WOE or the Group variables as inputs.

For your coarse classification and logistic regression, I would advise using the Interactive Grouping node and the Scorecard node. These two nodes are well suited for logistic regression and you also get a scorecard format with points that are easy to interpret. By default, every 20 points the odds of an event double.

I hope it helps,

Miguel

All Replies

Solution

05-13-2014
10:43 AM

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Highlight
- Email to a Friend
- Report Inappropriate Content

Posted in reply to SIlver_Geo

05-13-2014 10:43 AM

Hi Silver Geo,

If I understand the Interactive Grouping and Scorecard nodes correctly,

1) You get both the groupings and the weights of evidence in case you have a preference to fit a logistic regression using either of them. Notice that the fourth train property in the Scorecard node is Analysis Variables and you can specify it as WOE (by defualt) or Group.

Most SAS Enterprise Miner customers would use the WOE.

2) The Scorecard node is set to automatically use either the WOE or Groups. This is not the case with the Regression node. The Regression node gives you the flexibility of additional options like the type of regression and the link function, but you have to also take control of the inputs that you use. A case where you might like to take this approach would be when you are interested in a linear regression, but you would have to use either the WOE or the Group variables as inputs.

For your coarse classification and logistic regression, I would advise using the Interactive Grouping node and the Scorecard node. These two nodes are well suited for logistic regression and you also get a scorecard format with points that are easy to interpret. By default, every 20 points the odds of an event double.

I hope it helps,

Miguel

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Highlight
- Email to a Friend
- Report Inappropriate Content

Posted in reply to M_Maldonado

05-13-2014 11:18 AM

Dear Miguel,

Thank you for your assistance, I made quite an extensive reading and I was able to understand the whole process. I used in the end only WoE variables since in behavioural credit scoring can make more sense and show the risk for every variable.

Best Regards,

George