Hi Patrick, I am using SAS Dataflux. I have to agree that Dataflux is indeed a great software. However, as you mentioned, Dataflux collapses similar strings into a single matchcode which I then can use it to join my table. Not sure you are referring to Match Codes under the Entity Resolution node. If yes, then I have a question. As you will find in my attachment, the final output (All Matches and All_Non_Match) files have lesser total number of firms than my original input files (Text file input 1 and Text file input 2). In other words, if the total number of firms in original input files (Text file input 1 and Text file input 2) consist of 1000 firms, then logically, the final output (All Matches and All_Non_Match) files should also consist of 1000 right? In my case, I lost about 148 firms and upon checking manually, I realize that companies such ABC 1995, ABC 1996, ABC 1997 (which may refer to the similar or not similar companies) have been merged into one, which is ABC. Can I customize the Match Codes function so that ABC 1995, ABC 1996, ABC 1997 are considered as different companies? My sincere advance thank you. Warm regards, Steven
... View more