Text mining and content categorization

Matching Name & Addresses

Reply
Occasional Contributor
Posts: 5

Matching Name & Addresses

Hello,

I am hoping that someone out there has already solved this and can share some handy code.

I have two separate datasets that contain client information listing Name, Address, City, State and Zip code.  data1 is supplied by an external vendor while data2 is our internal records so they are not entered exactly the same way.  I need some ideas on best way to use SAS to process both files and find matches for same customer between the two files even if the name or address information is not entered as identical between the 2 files.  I use base SAS v9.4 in Windows environment.

Thanks, in advance, for any insight!

Grand Advisor
Posts: 16,875

Re: Matching Name & Addresses

There isn't an exact way, but you look at some fuzzy matching options. Some functions to look into are:

COMPGED

COMPLEV

SOUNDS LIKE

SOUNDEX

SPEDIS

There's a post I like on here that goes through several iterations to find a match, by FriedEgg.

Occasional Contributor
Posts: 5

Re: Matching Name & Addresses

Hello Reeza,

Thank you very much for sharing the post from FriedEgg.  It was exactly what I needed!

Ask a Question
Discussion stats
  • 2 replies
  • 442 views
  • 4 likes
  • 2 in conversation