Hi,
we won't to flag records that have either duplicate names or names that are close (ie missing the JR or SR or MS vs MRS).
we always want to do this for addresses.
Thank You,
Louise
Duplicates are easy, close is not. The problem with close is that you literally need to compare each address to all other addresses to determine closenes.
Cleaning addresses is not fun, but look at some papers on LexJansen.com for some methods to standardize the data.
If the task is critical, and you need to do this a lot, you might want to take look at SAS Data Management Studio.
It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.