DATA Step, Macro, Functions and more

duplicates

Reply
New Contributor
Posts: 4

duplicates

Hi,

 

we won't to flag records that have either duplicate names or names that are close (ie missing the JR or SR or MS vs MRS).

 

we always want to do this for addresses.

 

Thank You,

 

Louise

Super User
Posts: 19,772

Re: duplicates

Posted in reply to lu2kaseff

Duplicates are easy, close is not. The problem with close is that you literally need to compare each address to all other addresses to determine closenes. 

 

Cleaning addresses is not fun, but look at some papers on LexJansen.com for some methods to standardize the data.

Super User
Posts: 5,424

Re: duplicates

Posted in reply to lu2kaseff

If the task is critical, and you need to do this a lot, you might want to take look at SAS Data Management Studio.

Data never sleeps
Ask a Question
Discussion stats
  • 2 replies
  • 126 views
  • 0 likes
  • 3 in conversation