BookmarkSubscribeRSS Feed
SASPhile
Quartz | Level 8
I'm wondering if there is any text mining technique that is used to mtach the address fields based on phyid.
for instance:
Phyid adr1 phyadr1
010011829 501 MED CENT DR 501 MEDICAL CENTER DR BOX 30114
010011829 501 MED CENTER DR STE:300 501 MEDICAL CENTER DR BOX 30114

adr1 and phyadr1 are the same.but this is how the data is provided.
how to match adr1 and phyadr1 in such cases?
1 REPLY 1
Daryl
SAS Employee
Does your site license the SAS Data Quality Server? This gives you procedures and data step functions that allow fuzzy matching for text data such as names and addresses.

If you don't have a license for the data quality tools, you would need a way to parse the addresses into components (house number, street name, box number, suite number, etc.), standardize these elements, and then order them to create appropriate groups that represent the same location.

about DQ Server: http://support.sas.com/documentation/cdl/en/dqclref/63101/HTML/default/viewer.htm#a003359998.htm

about PROC DQMATCH:
http://support.sas.com/documentation/cdl/en/dqclref/63101/HTML/default/viewer.htm#a002629014.htm

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 1 reply
  • 873 views
  • 0 likes
  • 2 in conversation