turn on suggestions

Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results for

Find a Community

- Home
- /
- Analytics
- /
- Data Mining
- /
- Oversampling in EMiner

Topic Options

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Highlight
- Email to a Friend
- Report Inappropriate Content

12-16-2016 10:17 AM - edited 12-16-2016 11:16 AM

Quite obvious my target event is rare, so I put a decision node to adjust the priors, Original proporition was 0.04/0.96 after adjust ment its 0.3/0.7. the decision tab on the decision node is 1.0 - 1.0 at digonal positions.

When I am scoring a test data the probabilites of the event that I am getting is around 0.3(event=1) and 0.7(event=0) I thougth EM do the adjustments for the probabilities and expected probabilites near 0.04 and 0.96. Is there anything I have to do to get the probilities adjusted, as I would get without the decision node?

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Highlight
- Email to a Friend
- Report Inappropriate Content

Posted in reply to MagicalEmerald

12-16-2016 03:39 PM

If what you are saying is the proportion in your sample was originally 0.04/0.96, then you oversampled to get 0.3/0.7, then in a Decisions node, you should have entered the original prior probabilities as the Adjusted Priors. Then if you look in the score code in the Score node, you should see lines like this doing the adjustment of the posterior probabilities:

*** Update Posterior Probabilities;

P_BAD1 = P_BAD1 * 0.04 / 0.3;

P_BAD0 = P_BAD0 * 0.96 / 0.7;

drop _sum; _sum = P_BAD1 + P_BAD0 ;