Hi. I'm a first time poster and I'm a newbie in the world of data mining. I have to be honest, I'm working on an assessment that requires me to produce different decision trees using SAS Enterprise Miner by coming up with different configuration. I'm also quite new in using Enteprise Miner and had just gone through the Intro to SAS Enterprise Miner pdf. My dataset (which is an imported csv) is composed of bank data (e.g. age, education, job, housing loan, personal loan, consumer index, precious outcome of marketing campaign etc) that intends to predict whether a client will sign up for a lon or not, so I have a signup class with yes and no as the expected result. I noticed in my whole dataset which is around 40k rows, most of the results are No, only around 10% are yes. Based on my readings, this is called an unbalanced data set and I won't get a good model out of this. I did some more readings and I read somewhere that I need to undersample the "No" results in order to oversome this over representation. I raised this to my professor and ask him how I can do it in Enterprise Miner, but he just ignored me, so I'm left to do it on my own. My question is how do i go about this using SAS Enterprise Miner 13.1?
... View more