BookmarkSubscribeRSS Feed
omerzeybek
Obsidian | Level 7

hello

i am suspicious that some of my inputs are nearly same with my target variable. Because in desicion tree process these inputs receives considerably high logworth stats.

do you know how can i check correlations between my target and inputs?

5 REPLIES 5
M_Maldonado
Barite | Level 11

Hi Omer,

I think what you need is to calculate VIF to assess multicollinearity. HPReg node this automatically for you. If you are working with an older EM version you can also write some code for proc reg. See https://communities.sas.com/message/188233

Since you are talking about trees, they are useful to detect 2-way interactions. After your first split, what variable is selected for the largest tree of maximum depth 2?

Do you get similar results from both VIF and a tree to detect 2-way interactions?

I hope that helps,

Miguel

omerzeybek
Obsidian | Level 7

Hi Miguel

unfortunately i am working with a sas miner screen without a hpreG node. On the other i dont know how to write code in saas miner. By the way i have read the message you metioned

WendyCzika
SAS Employee

You can use the StatExplore node (on the Explore tab) to calculate correlations between an interval target and interval inputs.

omerzeybek
Obsidian | Level 7

Thanks Wendy

but i have a binary target variable is it also applicable for this kind of errors too

WendyCzika
SAS Employee

Yes in that case you can use the Chi-Square Statistic reported in StatExplore.  Usually it is used in the opposite way though - you want to keep the inputs associated with the target Smiley Happy 

SAS Innovate 2025: Call for Content

Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!

Submit your idea!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 5 replies
  • 2878 views
  • 3 likes
  • 3 in conversation