I have a data set with 100k records of customer information.
I'm planning to build some simple rules for matching (ex. firstname + lastname);
So the code must go like matching record 1 to the other records then if it matches then i put the ids of these records in an output table.
Greatly appreciate your advise what are the possible faster approaches to attain these matching of records across a single table.
Thanks!
It sounds like you're looking to do data linkages based on identifiers. Here's a tool that has been referenced - though I've never used it.
Statistics Canada offers a tool called G-Link as well, free but they recommend support, you can find it via google.
Additionally, here's a solution that I kind of like that uses a few of the fuzzy matching options.
https://communities.sas.com/t5/SAS-Procedures/Name-matching/td-p/82780
Don't miss out on SAS Innovate - Register now for the FREE Livestream!
Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.