I have a data set with 100k records of customer information.
I'm planning to build some simple rules for matching (ex. firstname + lastname);
So the code must go like matching record 1 to the other records then if it matches then i put the ids of these records in an output table.
Greatly appreciate your advise what are the possible faster approaches to attain these matching of records across a single table.
Thanks!
It sounds like you're looking to do data linkages based on identifiers. Here's a tool that has been referenced - though I've never used it.
Statistics Canada offers a tool called G-Link as well, free but they recommend support, you can find it via google.
Additionally, here's a solution that I kind of like that uses a few of the fuzzy matching options.
https://communities.sas.com/t5/SAS-Procedures/Name-matching/td-p/82780
It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.