BookmarkSubscribeRSS Feed
huks
Fluorite | Level 6

Hi guys, I need to do name matching.

When there are differences at the end of the name, e.g., Martin/Martino, I can find an identical match by reducing the sensitivity

huks_0-1756295306745.png

huks_1-1756295387957.png

but the problem is when there are differences at the beginning, e.g., Fikir/Sikir.

huks_2-1756295562092.png

huks_3-1756295656195.png

 

How should I configure it to find an identical matchcode, because the sensitivity always reduces from the end to the beginning? There is just 1 letter difference between the two and I can't find the match!

 

Thanks for your help

 

Hugues

7 REPLIES 7
SASKiwi
PROC Star

I'm pretty sure that letter differences at the beginning of a name are much more important than at the end. For example Bell and Dell, Ball and and Wall are clearly different surnames so shouldn't have the same match codes.

huks
Fluorite | Level 6
Hi, maybe if you only check this column. But if you add a matching on name, address .... with different sensibilities and rules like 50 % on surname + 100% on address ... i have to determine the exact matches or near-matches
audrey
SAS Employee

You might want to customized the QKB definition to fit your needs if the existing one doesn't.

Take a look at this video for a quick introduction
https://www.youtube.com/watch?v=TdvKDVdBrdk

Hope this helps,

Audrey

www.sas.comsupport.sas.com
SAS®... THE POWER TO KNOW®
huks
Fluorite | Level 6
thanks but i know how to customize a definition but I don't know how to do what i want
audrey
SAS Employee

If your goal is to identify strings that have just 1 letter difference, maybe it's easier to use a distance function in an Expression node.

Because when you make changes to the matching definition, it can have a large impact. For your example, you could be updating the Phonetics library to say "F" ans "S" are similar in sound, and that would fix this specific issue. But it would have a much bigger impact and could trigger lots of false positives.

I don't think there is a easy fix without knowing your data and requirements.

Audrey

 

www.sas.comsupport.sas.com
SAS®... THE POWER TO KNOW®
huks
Fluorite | Level 6

thanks for your answer. I can do that indeed but but what I want to try is to make a matchcode reading from the left to the right and not as usual, from the right to the left . Because as i explained before, if I have Martin and Martina or Allan and Allen . i can find a near match playing in the sensitivity but with the difference on the first char i can't.

Patrick
Opal | Level 21

Like @SASKiwi already mentioned a difference in the first letter is normally relevant and the match codes need to be different. If you have cases where they should be the same then you need to learn how to amend/extend the out of the box QKB to cover your special cases. This should become a BAU task if using SAS DQ in earnest. 

hackathon24-white-horiz.png

2025 SAS Hackathon: There is still time!

Good news: We've extended SAS Hackathon registration until Sept. 12, so you still have time to be part of our biggest event yet – our five-year anniversary!

Register Now

How to connect to databases in SAS Viya

Need to connect to databases in SAS Viya? SAS’ David Ghan shows you two methods – via SAS/ACCESS LIBNAME and SAS Data Connector SASLIBS – in this video.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 7 replies
  • 243 views
  • 0 likes
  • 4 in conversation