Is there a way in SAS EG to find duplicated values? Must have duplicated rows in CIN and YY columns but have distinct MM. And I need all three columns in the output. Example data below:
Thanks @LinusH. Just to confirm, did you mean that I should add CIN, YY, MM to the Select Data (columns) in the Query Builder, then use COUNT as a summary function for MM. Then add CIN, YY to the Group By. Add Count(*) >1 to HAVING clause under Filter tab?
If you want to this using UI rather than direct programming, try Query Builder, by defining CIN and YY as grouping variables, and with a having filter on count(*) > 1.
Thanks @LinusH. Just to confirm, did you mean that I should add CIN, YY, MM to the Select Data (columns) in the Query Builder, then use COUNT as a summary function for MM. Then add CIN, YY to the Group By. Add Count(*) >1 to HAVING clause under Filter tab?
I think so, I don't have access to EG right now so I can't verify.
But you should end up in a SQL similar to:
proc sql;
select cin, yy, mm
from have
group by cin, yy
having count(*) > 1;
quit;
Thanks @LinusH While my original goal was to identify the duplicate row values, your suggestion proved to be valuable in a broader sense. It allowed me to at least determine the count of rows with duplicate values.
April 27 – 30 | Gaylord Texan | Grapevine, Texas
Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!
Check out this tutorial series to learn how to build your own steps in SAS Studio.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.