DATA Step, Macro, Functions and more

text mining for address match

Reply
Super Contributor
Posts: 673

text mining for address match

I'm wondering if there is any text mining technique that is used to mtach the address fields based on phyid.
for instance:
Phyid adr1 phyadr1
010011829 501 MED CENT DR 501 MEDICAL CENTER DR BOX 30114
010011829 501 MED CENTER DR STE:300 501 MEDICAL CENTER DR BOX 30114

adr1 and phyadr1 are the same.but this is how the data is provided.
how to match adr1 and phyadr1 in such cases?
SAS Employee
Posts: 27

Re: text mining for address match

Does your site license the SAS Data Quality Server? This gives you procedures and data step functions that allow fuzzy matching for text data such as names and addresses.

If you don't have a license for the data quality tools, you would need a way to parse the addresses into components (house number, street name, box number, suite number, etc.), standardize these elements, and then order them to create appropriate groups that represent the same location.

about DQ Server: http://support.sas.com/documentation/cdl/en/dqclref/63101/HTML/default/viewer.htm#a003359998.htm

about PROC DQMATCH:
http://support.sas.com/documentation/cdl/en/dqclref/63101/HTML/default/viewer.htm#a002629014.htm
Ask a Question
Discussion stats
  • 1 reply
  • 242 views
  • 0 likes
  • 2 in conversation