- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
Posted 04-01-2017 08:17 AM
(1339 views)
Hi All.
I have a data file which has many columns.
A lot of variable have 0 as values.
I need to find which are the top variables that will be problematic because of the presence of a lot of 0’s?
Thanks & Regards
Vishyy
2 REPLIES 2
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
make use of proc frequency for this. you will hv 2 go thru the syntax ol. this proc will answer your questn.
- Mark as New
- Bookmark
- Subscribe
- Mute
- RSS Feed
- Permalink
- Report Inappropriate Content
It will be easier if you create a format to group other values. For example:
proc format;
value posneg low-<0 = 'Negative' 0='Zero' 0-high='Positive';
run;
proc freq data=have;
tables _numeric_ / missing;
format _numeric_ posneg.;
run;
That way, you won't have to sort through huge tables with potentially thousands of different values for each variable.