BookmarkSubscribeRSS Feed
Calcite | Level 5

I tried using a variable clustering node and a variable selection node to reduce the redundant variables, however, noticed that in the model comparison node, the random model scored best. why? 

Super User

Insufficient detail.

Define the criteria used to determine best, share the data and show the code used (or generated by nodes) to make the models.


Then someone may be able to answer.

SAS Employee

As mentioned, we need more details to help answer this question.  As a best practice, I want to point you to a paper that hits on several data mining topics.  It is several years old, but it is a fantastic paper to help in optimizing your data mining analysis.  It is called Identifying and Overcoming Common Data Mining Mistakes.  It is found at the following URL:


Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.


Register now!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 3 in conversation