BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
husseinmazaar
Quartz | Level 8

Dear Consultants,

I have dataset (599*14747) as follows: 599 observation  for 6 classes and 14746 independent varaiables (predictors). the predictors are interval and the target(response) variable is nominal(categorical). I tried to find correlation to reduce the irrelevant variables using Pearson correlation but I think it finds only linear relationship only, also Pearson are changed with outliers. I need to know the optimal way to find the variables importance using statistical and data mining methods to improve the classification. notes: I have SAS Enterprise Miner 13.2

1 ACCEPTED SOLUTION

Accepted Solutions
M_Maldonado
Barite | Level 11

Hi Hussein,

I remember we were discussing techniques for variable selection and dimension reduction in an earlier thread ().

Do some of those techniques give you a better model than others?

Different techniques have different pros and cons. Many times you have to try a lot of them to see which one is most helpful to your data and business problem.

The course Advanced Predictive Modeling Using SAS Enterprise Miner is a good hands-on refresher of all the tools you have in your toolkit. I am considering taking that class again :smileysilly:

Post some of your findings if you have a chance.

Thanks!

-Miguel

View solution in original post

1 REPLY 1
M_Maldonado
Barite | Level 11

Hi Hussein,

I remember we were discussing techniques for variable selection and dimension reduction in an earlier thread ().

Do some of those techniques give you a better model than others?

Different techniques have different pros and cons. Many times you have to try a lot of them to see which one is most helpful to your data and business problem.

The course Advanced Predictive Modeling Using SAS Enterprise Miner is a good hands-on refresher of all the tools you have in your toolkit. I am considering taking that class again :smileysilly:

Post some of your findings if you have a chance.

Thanks!

-Miguel

sas-innovate-2026-white.png



April 27 – 30 | Gaylord Texan | Grapevine, Texas

Registration is open

Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!

Register now

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 2254 views
  • 0 likes
  • 2 in conversation