I'm running frequencies of variables that have thousands of different values. But I'm only interested in the most frequently-occurring. How can I limit the output to codes that make up, say, 1% or more of the total? That is, in the frequency output table, the "Percent" column should be 1.00 or greater. I'm currently using the following syntax to limit to a specified number of observations, but sometimes that approach leaves out values that I want to look at or include values that occur too infrequently to be bothered with.
PROC FREQ noprint ORDER=FREQ;
TABLES DX_code /out=medicaid.countout_grp3;
PROC PRINT DATA = medicaid.countout_grp3(obs=100);
RUN;
You can use variable PERCENT in the output dataset:
proc print data=medicaid.countout_grp3;
where percent>=1;
run;
You can use variable PERCENT in the output dataset:
proc print data=medicaid.countout_grp3;
where percent>=1;
run;
Simple and effective! Thanks!
Registration is now open for SAS Innovate 2025 , our biggest and most exciting global event of the year! Join us in Orlando, FL, May 6-9.
Sign up by Dec. 31 to get the 2024 rate of just $495.
Register now!
Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.