BookmarkSubscribeRSS Feed
SASPhile
Quartz | Level 8
I'm wondering if there is any text mining technique that is used to mtach the address fields based on phyid.
for instance:
Phyid adr1 phyadr1
010011829 501 MED CENT DR 501 MEDICAL CENTER DR BOX 30114
010011829 501 MED CENTER DR STE:300 501 MEDICAL CENTER DR BOX 30114

adr1 and phyadr1 are the same.but this is how the data is provided.
how to match adr1 and phyadr1 in such cases?
1 REPLY 1
Daryl
SAS Employee
Does your site license the SAS Data Quality Server? This gives you procedures and data step functions that allow fuzzy matching for text data such as names and addresses.

If you don't have a license for the data quality tools, you would need a way to parse the addresses into components (house number, street name, box number, suite number, etc.), standardize these elements, and then order them to create appropriate groups that represent the same location.

about DQ Server: http://support.sas.com/documentation/cdl/en/dqclref/63101/HTML/default/viewer.htm#a003359998.htm

about PROC DQMATCH:
http://support.sas.com/documentation/cdl/en/dqclref/63101/HTML/default/viewer.htm#a002629014.htm

hackathon24-white-horiz.png

2025 SAS Hackathon: There is still time!

Good news: We've extended SAS Hackathon registration until Sept. 12, so you still have time to be part of our biggest event yet – our five-year anniversary!

Register Now

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 1 reply
  • 1278 views
  • 0 likes
  • 2 in conversation