SAS Data Integration Studio, DataFlux Data Management Studio, SAS/ACCESS, SAS Data Loader for Hadoop and others

Software Generates Different Matching Codes For the Same Field Values

Reply
Contributor
Posts: 23

Software Generates Different Matching Codes For the Same Field Values

In the first table I have defined field 'TBL_1_Door_Number' as TEXT, Sensitivity=85.

In the second table I have defined field "TBL_2_Door_Number" the same way as in the first table.

 

After that I am generating matching codes based on those fields and joining the result in one table.

Although the definition is same these two fields from different tables generate DIFFERENT MATCHING CODES for SAME values.

For example, the text value '5' in the first table generates the matching code 'H$$$$$$$$$$$$$$' and the same text value in the second table generates the matching code '5$$$$$$$$$$$$$$'.

 

Can anyone give me an explanation why the software generates different matching codes for the same values?

Super User
Posts: 7,809

Re: Software Generates Different Matching Codes For the Same Field Values

Show code and log, please. And some test data (in the form of data steps with cards) to illustrate and repeat your problem in our local environments.

---------------------------------------------------------------------------------------------
Maxims of Maximally Efficient SAS Programmers
Contributor
Posts: 23

Re: Software Generates Different Matching Codes For the Same Field Values

[ Edited ]
Posted in reply to KurtBremser

you will find in the attachment the data jobs and data sample (extracted from MS SQL Server).

 

(since I am completely new to SAS tool, I would also appreciate any other advice you can give me, for example regarding the data jobs I made in order to merge the addresses: is it the correct way to do it etc.)

 

loking forward to your reply.

 

UPDATE:
additionaly, i do not understand how software for the field value '5' generates matching code '5$$$$$$$$$$$$$$', but for field values '8' and '2' generates matching codes 'D$$$$$$$$$$$$$$' and 'H$$$$$$$$$$$$$$' respectively. Shouldn't the software for falues '8' and '2' generate '8$$$$$$$$$$$$$$' and '2$$$$$$$$$$$$$$' respectively?

I got the same results by changign sensitivity from 85 -> 95%.

 

Attachment
Ask a Question
Discussion stats
  • 2 replies
  • 263 views
  • 0 likes
  • 2 in conversation