BookmarkSubscribeRSS Feed
gwcanalytics
Calcite | Level 5

Hello,

I am hoping that someone out there has already solved this and can share some handy code.

I have two separate datasets that contain client information listing Name, Address, City, State and Zip code.  data1 is supplied by an external vendor while data2 is our internal records so they are not entered exactly the same way.  I need some ideas on best way to use SAS to process both files and find matches for same customer between the two files even if the name or address information is not entered as identical between the 2 files.  I use base SAS v9.4 in Windows environment.

Thanks, in advance, for any insight!

2 REPLIES 2
Reeza
Super User

There isn't an exact way, but you look at some fuzzy matching options. Some functions to look into are:

COMPGED

COMPLEV

SOUNDS LIKE

SOUNDEX

SPEDIS

There's a post I like on here that goes through several iterations to find a match, by FriedEgg.

gwcanalytics
Calcite | Level 5

Hello Reeza,

Thank you very much for sharing the post from FriedEgg.  It was exactly what I needed!

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 1225 views
  • 4 likes
  • 2 in conversation