BookmarkSubscribeRSS Feed
deleted_user
Not applicable
I am trying to predict a rare event, I read about using oversampling with the sampling node both on the following link and on EM's Help.
http://support.sas.com/kb/24/205.html

The link says that I'm not supposed to adjust frecuency for oversampling but EM's help says I should. My intention is to make a model and then score a large database with the resulting model, Should I adjust the frecuency for oversampling or not?

I tried both approaches, the cumulative lift and even some of the resulting independent variables are very different.
1 REPLY 1
Karsten_SAS
SAS Employee
Hi,

what Enterprise Miner version are you using? In Enterprise Miner 5.x, do not select the "adjust frequency for oversampling" check box as it offsets the level-based sampling / over-sampling. To my mind, you can either use the level-based sampling approach to over-sampling OR the adjust frequency approach to over-sampling. I use diagrams like this one in EM 5.3:

Input Data Source (_without_ a target profile)
>
Sample Node (with level-based sampling, no frequency adjustment)
>
Decision node (create an appropriate target profile to reflect the true priors)
>
[...]

Cheers,
Karsten

sas-innovate-2026-white.png



April 27 – 30 | Gaylord Texan | Grapevine, Texas

Registration is open

Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!

Register now

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 1956 views
  • 0 likes
  • 2 in conversation