I'm running frequencies of variables that have thousands of different values. But I'm only interested in the most frequently-occurring. How can I limit the output to codes that make up, say, 1% or more of the total? That is, in the frequency output table, the "Percent" column should be 1.00 or greater. I'm currently using the following syntax to limit to a specified number of observations, but sometimes that approach leaves out values that I want to look at or include values that occur too infrequently to be bothered with.
PROC FREQ noprint ORDER=FREQ;
TABLES DX_code /out=medicaid.countout_grp3;
PROC PRINT DATA = medicaid.countout_grp3(obs=100);
RUN;
You can use variable PERCENT in the output dataset:
proc print data=medicaid.countout_grp3;
where percent>=1;
run;
You can use variable PERCENT in the output dataset:
proc print data=medicaid.countout_grp3;
where percent>=1;
run;
Simple and effective! Thanks!
Nearly 200 sessions are now available on demand in the Innovate Hub.
Watch Now →Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.