cancel
Turn on suggestions
Showing results for
Search instead for
Did you mean:
BookmarkSubscribeRSS Feed
☑ This topic is solved. Need further help from the community? Please sign in and ask a new question.
Fluorite | Level 6

Hi guys, I need to do name matching.

When there are differences at the end of the name, e.g., Martin/Martino, I can find an identical match by reducing the sensitivity

huks_0-1756295306745.png

huks_1-1756295387957.png

but the problem is when there are differences at the beginning, e.g., Fikir/Sikir.

huks_2-1756295562092.png

huks_3-1756295656195.png

How should I configure it to find an identical matchcode, because the sensitivity always reduces from the end to the beginning? There is just 1 letter difference between the two and I can't find the match!

Thanks for your help

Hugues

0 Likes
1 ACCEPTED SOLUTION

Accepted Solutions
PROC Star

Maybe a combination of an address match and surname spelling difference might be a simpler option than tinkering with the QKB in that case. I've combined techniques like this myself and they can be quite useful.

0 Likes
9 REPLIES 9
PROC Star

I'm pretty sure that letter differences at the beginning of a name are much more important than at the end. For example Bell and Dell, Ball and and Wall are clearly different surnames so shouldn't have the same match codes.

0 Likes
Fluorite | Level 6
Hi, maybe if you only check this column. But if you add a matching on name, address .... with different sensibilities and rules like 50 % on surname + 100% on address ... i have to determine the exact matches or near-matches
0 Likes
PROC Star

Maybe a combination of an address match and surname spelling difference might be a simpler option than tinkering with the QKB in that case. I've combined techniques like this myself and they can be quite useful.

0 Likes
Fluorite | Level 6

yes i already do that with firstname and lastname but i think i'll change the sensibility level thanks

0 Likes
SAS Employee

You might want to customized the QKB definition to fit your needs if the existing one doesn't.

Take a look at this video for a quick introduction
https://www.youtube.com/watch?v=TdvKDVdBrdk

Hope this helps,

Audrey

www.sas.com しかく support.sas.com
SAS®... THE POWER TO KNOW®
0 Likes
Fluorite | Level 6
thanks but i know how to customize a definition but I don't know how to do what i want
0 Likes
SAS Employee

If your goal is to identify strings that have just 1 letter difference, maybe it's easier to use a distance function in an Expression node.

Because when you make changes to the matching definition, it can have a large impact. For your example, you could be updating the Phonetics library to say "F" ans "S" are similar in sound, and that would fix this specific issue. But it would have a much bigger impact and could trigger lots of false positives.

I don't think there is a easy fix without knowing your data and requirements.

Audrey

www.sas.com しかく support.sas.com
SAS®... THE POWER TO KNOW®
0 Likes
Fluorite | Level 6

thanks for your answer. I can do that indeed but but what I want to try is to make a matchcode reading from the left to the right and not as usual, from the right to the left . Because as i explained before, if I have Martin and Martina or Allan and Allen . i can find a near match playing in the sensitivity but with the difference on the first char i can't.

0 Likes
Opal | Level 21

Like @SASKiwi already mentioned a difference in the first letter is normally relevant and the match codes need to be different. If you have cases where they should be the same then you need to learn how to amend/extend the out of the box QKB to cover your special cases. This should become a BAU task if using SAS DQ in earnest.

sasinnovate.png



April 27 – 30 | Gaylord Texan | Grapevine, Texas

Registration is open

Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just 495ドル!

Register now

How to connect to databases in SAS Viya

Need to connect to databases in SAS Viya? SAS’ David Ghan shows you two methods – via SAS/ACCESS LIBNAME and SAS Data Connector SASLIBS – in this video.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
[フレーム]

AltStyle によって変換されたページ (->オリジナル) /