Hello everyone, I have always found very useful the sas community. For the first time, I have not found what I am looking for, therefore, here I am posting for the first time. 🙂 I am working on a classification task for marketing using enterprise miner (last version). I have 30 variables and I must predict whether the customer will accept or refuse our next direct marketing offer. Besided the target variable, socio-demographic, and firmographic variables, I have 5 binary variables. Each of these binary variables represent whether the customer responded to the previous marketing offers (from campaign 1 to campaign 5) What I want is to understand the correlation among such five binary variables and, eventually, the worth of such binary vector in predicting the target variable. After some research, I discovered the best candidate are the Phi (using the PROC CORR PEARSON on binary variables) and the Tethracoric correlation (special case of polychoric correlation for binary variables). I discovered that with the latter correlation measure, I obtain a much higher correlation compared to the Phi. Do you know why? In this context, what is from your experience the best correlation measure? Thank you very much and enjoy your Easter.
... View more