BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
grezek
Obsidian | Level 7

While I've used EM/Scorecard to recreate scorecards (I'm in model validation), I've never had to select input variables from a large number of potential inputs.  I've tried Interactive Grouping determine the inputs with the highest IV's, but the process choked on the number of input variables (1,400).  What is the standard approach(es)?  In the data I can identify families of inputs.

 

Any suggestions are welcome.  Thank you.  -- George Rezek

1 ACCEPTED SOLUTION

Accepted Solutions
Ksharp
Super User

If I were you, I would use PROC HPGENSELECT or PROC PLS pickup the most 30-40 variables and Group these 30-40 variables by ScoreCard node.

View solution in original post

2 REPLIES 2
Ksharp
Super User

If I were you, I would use PROC HPGENSELECT or PROC PLS pickup the most 30-40 variables and Group these 30-40 variables by ScoreCard node.

grezek
Obsidian | Level 7
Thank you very much. I'll investigate and let you know how it goes. Thanks for the help. -- George

sas-innovate-white.png

Our biggest data and AI event of the year.

Don’t miss the livestream kicking off May 7. It’s free. It’s easy. And it’s the best seat in the house.

Join us virtually with our complimentary SAS Innovate Digital Pass. Watch live or on-demand in multiple languages, with translations available to help you get the most out of every session.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 1086 views
  • 1 like
  • 2 in conversation