04-01-2017 08:17 AM
I have a data file which has many columns.
A lot of variable have 0 as values.
I need to find which are the top variables that will be problematic because of the presence of a lot of 0’s?
Thanks & Regards
04-01-2017 08:54 AM
It will be easier if you create a format to group other values. For example:
value posneg low-<0 = 'Negative' 0='Zero' 0-high='Positive';
proc freq data=have;
tables _numeric_ / missing;
format _numeric_ posneg.;
That way, you won't have to sort through huge tables with potentially thousands of different values for each variable.