BookmarkSubscribeRSS Feed
PetePatel
Quartz | Level 8

Hi,

 

I have a large dataset (c.350k records( with unique identifier (ID), target and c. 1000 variables for which I need to calculate an information value.

 

Ideally, I would like to get a list of the variables with WOE and information value attached so I can start filtering.

 

Does anybody have any code that will allow me to do this efficiently in SAS?

 

Thanks

2 REPLIES 2
Ksharp
Super User

I could suggest using PROC HPGENSELECT or PROC PLS to select the most 30 significant variables,

and get these 30 vaiables's WOE and IV. 

If you want get better WOE and maximize IV to let your Score Card better and stronger ,

I wrote a paper about it , but you need SAS/IML and would cost you many time.

 

If you have SAS/EM's  Score Card node ,that is your first choice.

 

"Get Better Weight of Evidence for Scorecards Using a Genetic Algorithm"
https://www.sas.com/content/dam/SAS/support/en/sas-global-forum-proceedings/2018/1808-2018.pdf

sas-innovate-2026-white.png



April 27 – 30 | Gaylord Texan | Grapevine, Texas

Registration is open

Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and save with the early bird rate—just $795!

Register now

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 1710 views
  • 0 likes
  • 3 in conversation