While I've used EM/Scorecard to recreate scorecards (I'm in model validation), I've never had to select input variables from a large number of potential inputs. I've tried Interactive Grouping determine the inputs with the highest IV's, but the process choked on the number of input variables (1,400). What is the standard approach(es)? In the data I can identify families of inputs.
Any suggestions are welcome. Thank you. -- George Rezek
If I were you, I would use PROC HPGENSELECT or PROC PLS pickup the most 30-40 variables and Group these 30-40 variables by ScoreCard node.
If I were you, I would use PROC HPGENSELECT or PROC PLS pickup the most 30-40 variables and Group these 30-40 variables by ScoreCard node.
April 27 – 30 | Gaylord Texan | Grapevine, Texas
Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!
ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.
Find more tutorials on the SAS Users YouTube channel.