Hi all, I am trying to find an alternative way of checking the diversity in values of a variable. My goal is to set a measurement or weight that will indicate how well the variable is differentiated in its values and isn't characterized of lets 70% or 80 % of the same value. (another example would that variable X has 503 distinct values in 2000obs which i guess is good) My goal is to select based on that measure variables for segmentation modeling , cause i believe they can discriminate my data well. I am looking for something besides Proc univariate, means for stats / Varclus or PCA for variable selection, any idea? Thank you in advance
... View more