What's the most efficient way to count the no. of distinct values of a particular variable in SAS EG? Google brings up the Data Exploration tool, but it seems to only work for datasets that aren't already in the SAS project. The dataset that I want to count is in the WORK library and is a product of a table join by the query builder.
Thank you!
If you only want EG then I suggest the summarize task twice. First time will get you the different counts of each level, the second will count the number of levels.
A different way is through a QUERY using count Distinct as a calculated variable.
Proc Freq nlevels option is the fastest that I know.
Is that available as a task in EG? I meant a method specific to the UI in EG, if possible.
If you only want EG then I suggest the summarize task twice. First time will get you the different counts of each level, the second will count the number of levels.
A different way is through a QUERY using count Distinct as a calculated variable.
There is a custom task that you can download. It calculates the cardinality -- or number of distinct levels -- of each variable in your data set.
LINK: A custom task to check your data cardinality - The SAS Dummy
The UI looks like this:
Chris
Thank you all. I did not want to install the custom task as sometimes I work on a machine that isn't mine. But COUNT DISTINCT works well.
SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!
What’s the difference between SAS Enterprise Guide and SAS Studio? How are they similar? Just ask SAS’ Danny Modlin.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.