BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
JinboZhao
Calcite | Level 5

Hi,

 

I used SAS Enterprise Miner to generate a scorecard. For one variable in the scorecard, DAYS, it has all the discrete integer values from 1 to 31. It's WOE is like the follow table:

ValueGroupCutoffEventCountNon Event CountTotalEvent RateWOE 
_MISSING_1 00000
_UNKNOWN_1 00000
11 6931438360.8290.67349
82 188282160.870.34743
133 767830.916-0.1332
223 142111530.928-0.3063
154 267152820.947-0.6275
164 13051350.963-1.0064
195 923950.968-1.1715
235 12321250.984-1.8674
295 811820.988-2.1428

When I used another dataset to run the Interactive Grouping node, I got a continuous groups for the same DAYS variables as in the following table. Those two datasets are using exactly the same variables, only different values, because they represent two time's data. 

ValueGroupCutoffEventCountNon Event CountTotalEvent RateWOE 
_MISSING_1 00000
_UNKNOWN_1 179222010.8910.28689
1<=DAYS<71 179141930.927-0.1651
7<=DAYS<102 19592040.956-0.6925
10<=DAYS<253 172141860.925-0.1252
25<=DAYS<314 11871250.944

-0.4415

 

I do not want this value showing in interval groups. I need them in descrete values. I tried to split those 5 old interval groups into 31 interval groups, so that each group only has one integer value. But the problem is, Miner assign new groups with a numerical order. Like, less than 7 in group 1, DAYS between (7, 10) in group 2, but DAYS>21 (DAYS=21) can not be assign to group 1 again.

 

This is not what I want. Is there method to change it to the discrete values group, like in the first example? 

 

Thank you. 

 

1 ACCEPTED SOLUTION

Accepted Solutions
Reeza
Super User
I'm assuming you're talking about the first column, VALUE? If so, how are you importing the data into EM? That's usually where you change the types/settings. I would also check my source data to ensure my assumption that its discrete is true.

View solution in original post

5 REPLIES 5
Reeza
Super User
I'm assuming you're talking about the first column, VALUE? If so, how are you importing the data into EM? That's usually where you change the types/settings. I would also check my source data to ensure my assumption that its discrete is true.
JinboZhao
Calcite | Level 5

Thank you. You are right, in the first one, DAYS is Nominal level variable, but in the second one, it is the Interval level variable. 

Could you tell me how to change the variable's Level? 

Reeza
Super User

Personally, I would fix this type of stuff before working in EM. I find exploratory analysis and recoding variables a pain in EM. 

 

Also, you need to define the runs. I do believe there's a binning or recoding task in EM that can allow you to recode this however. 

JinboZhao
Calcite | Level 5

Thank you again. While, could you give some possible ways to fix it in SAS base or SAS  E-Guide? I really have no clue, becuase this variable is alreay integer in the original dataset. 

Reeza
Super User

@JinboZhao wrote:

 I really have no clue, becuase this variable is alreay integer in the original dataset. 


If this is the case, check the variable role. You can usually change that via the GUI to discrete. 

 

Your previous response implied it was different in the source data.

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 5 replies
  • 1205 views
  • 0 likes
  • 2 in conversation