<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Matching Company Names in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/526810#M282452</link>
    <description>&lt;P&gt;hi Cmoore,&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks for sending through the codes and i apologise for the late update. Had no access to my computer during&amp;nbsp; the weekend hence the late reply. Am now going to try the codes you sent.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, the names in both dataset are not almost the same, some differs. What i sent was to aid your understanding of how my dataset looked like. I think fuzzy matching will be the best way out as you mentioned. Any help on that as well? Thanks in advance for the help. Regards EJAA.&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Sun, 13 Jan 2019 23:45:14 GMT</pubDate>
    <dc:creator>EJAA</dc:creator>
    <dc:date>2019-01-13T23:45:14Z</dc:date>
    <item>
      <title>Matching Company Names</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/439674#M282440</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have two data sources of information that I want to merge together. The common matchkey across both sources is the Client Name. However, there are different variations of spellings for the same Client. Please see below a small example of the two datasets I have and what I would like it to be transformed to;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;Have1&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;Have2&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Client_Name&lt;/TD&gt;&lt;TD&gt;Sector&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;Client_Name&lt;/TD&gt;&lt;TD&gt;Sub_Sector&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;3i Group Plc&lt;/TD&gt;&lt;TD&gt;Global Financial Investors&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;3i&lt;/TD&gt;&lt;TD&gt;Private Equity&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;3m Company&lt;/TD&gt;&lt;TD&gt;Industrials&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;3M Company (NYSE:MMM)&lt;/TD&gt;&lt;TD&gt;Manufacturing industries&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Ag Barr Plc&lt;/TD&gt;&lt;TD&gt;Consumer and Healthcare Group&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;A.P. Møller - Maersk A/S (CPSE:MAERSK B)&lt;/TD&gt;&lt;TD&gt;Transport&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Maersk&lt;/TD&gt;&lt;TD&gt;Energy, Resources and Infrastructure&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;AB Volvo (publ) (OM:VOLV B)&lt;/TD&gt;&lt;TD&gt;Automotive&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Aa Plc&lt;/TD&gt;&lt;TD&gt;Industrials&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;Abertis&lt;/TD&gt;&lt;TD&gt;Infrastructure funds&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Electrolux&lt;/TD&gt;&lt;TD&gt;Consumer and Healthcare Group&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;BT&lt;/TD&gt;&lt;TD&gt;Telecoms&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Volvo&lt;/TD&gt;&lt;TD&gt;Industrials&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;3M Holding Company&lt;/TD&gt;&lt;TD&gt;Manufacturing industries&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Abb Ltd&lt;/TD&gt;&lt;TD&gt;Industrials&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;AB Electrolux (publ) (OM:ELUX B)&lt;/TD&gt;&lt;TD&gt;Consumer&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Abbott Laboratories&lt;/TD&gt;&lt;TD&gt;Consumer and Healthcare Group&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;RBS&lt;/TD&gt;&lt;TD&gt;Banks&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Abbvie Inc&lt;/TD&gt;&lt;TD&gt;Consumer and Healthcare Group&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;ABB&lt;/TD&gt;&lt;TD&gt;Manufacturing industries&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Aberdeen Asset Management Plc&lt;/TD&gt;&lt;TD&gt;Financial Institutions Group&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;AAM PLC (LSE:ADN)&lt;/TD&gt;&lt;TD&gt;Asset managers&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Aberforth Smaller Companies&lt;/TD&gt;&lt;TD&gt;Financial Institutions Group&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;AbbVie Incorp&lt;/TD&gt;&lt;TD&gt;Health&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Abertis Infraestructuras Sa&lt;/TD&gt;&lt;TD&gt;Global Financial Investors&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;Abbott Laboratories (NYSE:ABT)&lt;/TD&gt;&lt;TD&gt;Health&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Abn Amro Group Nv&lt;/TD&gt;&lt;TD&gt;Financial Institutions Group&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;Abertis Infraestructuras, S.A. (BME:ABE)&lt;/TD&gt;&lt;TD&gt;Infrastructure funds&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Britsh Telecommunications&lt;/TD&gt;&lt;TD&gt;Technology, Media and Telecoms&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;AG BAR&lt;/TD&gt;&lt;TD&gt;Consumer&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Royal Bank of Scotland PLC&lt;/TD&gt;&lt;TD&gt;Financial Institutions Group&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;AA&lt;/TD&gt;&lt;TD&gt;Business Services&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;The Coca Cola Company&lt;/TD&gt;&lt;TD&gt;Consumer and Healthcare Group&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;Coca cola&lt;/TD&gt;&lt;TD&gt;Consumer&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;And the final output should look like;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;Want&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Client Name&lt;/TD&gt;&lt;TD&gt;Sector&lt;/TD&gt;&lt;TD&gt;Sub_Sector&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;3i Group Plc&lt;/TD&gt;&lt;TD&gt;Global Financial Investors&lt;/TD&gt;&lt;TD&gt;Private Equity&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;3m Company&lt;/TD&gt;&lt;TD&gt;Industrials&lt;/TD&gt;&lt;TD&gt;Manufacturing industries&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Ag Barr Plc&lt;/TD&gt;&lt;TD&gt;Consumer and Healthcare Group&lt;/TD&gt;&lt;TD&gt;Consumer&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Maersk&lt;/TD&gt;&lt;TD&gt;Energy, Resources and Infrastructure&lt;/TD&gt;&lt;TD&gt;Transport&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Aa Plc&lt;/TD&gt;&lt;TD&gt;Industrials&lt;/TD&gt;&lt;TD&gt;Business Services&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Electrolux&lt;/TD&gt;&lt;TD&gt;Consumer and Healthcare Group&lt;/TD&gt;&lt;TD&gt;Consumer&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Volvo&lt;/TD&gt;&lt;TD&gt;Industrials&lt;/TD&gt;&lt;TD&gt;"Unknown"&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Abb Ltd&lt;/TD&gt;&lt;TD&gt;Industrials&lt;/TD&gt;&lt;TD&gt;Manufacturing industries&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Abbott Laboratories&lt;/TD&gt;&lt;TD&gt;Consumer and Healthcare Group&lt;/TD&gt;&lt;TD&gt;Health&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Abbvie Inc&lt;/TD&gt;&lt;TD&gt;Consumer and Healthcare Group&lt;/TD&gt;&lt;TD&gt;Health&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Aberdeen Asset Management Plc&lt;/TD&gt;&lt;TD&gt;Financial Institutions Group&lt;/TD&gt;&lt;TD&gt;Asset managers&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Aberforth Smaller Companies&lt;/TD&gt;&lt;TD&gt;Financial Institutions Group&lt;/TD&gt;&lt;TD&gt;Infrastructure funds&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Abertis Infraestructuras Sa&lt;/TD&gt;&lt;TD&gt;Global Financial Investors&lt;/TD&gt;&lt;TD&gt;Infrastructure funds&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Abn Amro Group Nv&lt;/TD&gt;&lt;TD&gt;Financial Institutions Group&lt;/TD&gt;&lt;TD&gt;"Unknown"&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Britsh Telecommunications&lt;/TD&gt;&lt;TD&gt;Technology, Media and Telecoms&lt;/TD&gt;&lt;TD&gt;Telecoms&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Royal Bank of Scotland PLC&lt;/TD&gt;&lt;TD&gt;Financial Institutions Group&lt;/TD&gt;&lt;TD&gt;Banks&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;The Coca Cola Company&lt;/TD&gt;&lt;TD&gt;Consumer and Healthcare Group&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Consumer&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Is there a reliable way for me to fuzzy match across the different client names? Any help would be greatly appreciated.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Many thanks&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Chris&lt;/P&gt;</description>
      <pubDate>Fri, 23 Feb 2018 12:45:03 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/439674#M282440</guid>
      <dc:creator>cmoore</dc:creator>
      <dc:date>2018-02-23T12:45:03Z</dc:date>
    </item>
    <item>
      <title>Re: Matching Company Names</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/439684#M282441</link>
      <description>&lt;P&gt;How reliable do you need to be?&lt;/P&gt;
&lt;P&gt;The most managed way to this is to use Data Management Studio.&lt;/P&gt;</description>
      <pubDate>Fri, 23 Feb 2018 13:19:33 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/439684#M282441</guid>
      <dc:creator>LinusH</dc:creator>
      <dc:date>2018-02-23T13:19:33Z</dc:date>
    </item>
    <item>
      <title>Re: Matching Company Names</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/439700#M282442</link>
      <description>&lt;P&gt;It doesn't have to be perfect to be honest. 70% plus maybe. Just need a good way to fuzzy match the example data set then take it from there.&lt;/P&gt;</description>
      <pubDate>Fri, 23 Feb 2018 14:07:27 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/439700#M282442</guid>
      <dc:creator>cmoore</dc:creator>
      <dc:date>2018-02-23T14:07:27Z</dc:date>
    </item>
    <item>
      <title>Re: Matching Company Names</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/439748#M282443</link>
      <description>70% what kind of analysis can you do with that little accuracy?&lt;BR /&gt;Given your examples  (and my limited knowledge in fuzzy logic) I think together can apply some basic rules like matching on uppercase first two words (normalized), or create an extensive mapping table (or a combination of both).&lt;BR /&gt;Or don't reinvent the well a pay for some capable SW.</description>
      <pubDate>Fri, 23 Feb 2018 16:10:24 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/439748#M282443</guid>
      <dc:creator>LinusH</dc:creator>
      <dc:date>2018-02-23T16:10:24Z</dc:date>
    </item>
    <item>
      <title>Re: Matching Company Names</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/439823#M282444</link>
      <description>&lt;P&gt;This is a non-trivial problem because you end up doing N*M calculations which can be quite big.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Some tips:&lt;/P&gt;
&lt;P&gt;1. Upcase everything&lt;/P&gt;
&lt;P&gt;2. Change all &amp;amp; or other symbols such as&amp;nbsp;@ from text&lt;/P&gt;
&lt;P&gt;3. Ensure Co and/or LTD are standardized, i.e. LTD vs LTD. vs Limited vs Limited&lt;/P&gt;
&lt;P&gt;4. Do an exact match and remove those records&lt;/P&gt;
&lt;P&gt;5. Remove all spaces and try a second round of exact matches&lt;/P&gt;
&lt;P&gt;6. Do fuzzy matching using COMPGED/COMPLEV/SOUNDS LIKE&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 23 Feb 2018 20:04:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/439823#M282444</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2018-02-23T20:04:21Z</dc:date>
    </item>
    <item>
      <title>Re: Matching Company Names</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/440390#M282445</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13879"&gt;@Reeza&lt;/a&gt;&amp;nbsp;I would add one step to reduce the size of the cartesian product&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;6. Do fuzzy matching using COMPGED/COMPLEV/SOUNDS LIKE/SPEDIS and equality of first letter&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;7. Do fuzzy matching using COMPGED/COMPLEV/SOUNDS LIKE&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;COMPGED works best in my experience&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 27 Feb 2018 05:02:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/440390#M282445</guid>
      <dc:creator>ChrisNZ</dc:creator>
      <dc:date>2018-02-27T05:02:21Z</dc:date>
    </item>
    <item>
      <title>Re: Matching Company Names</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/440838#M282446</link>
      <description>&lt;P&gt;Thanks for the information. I have a list of headline titles containing company names in and company lookup table. Please see below;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;Have 1&lt;/TD&gt;&lt;TD&gt;Have 2&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Headline_Title&lt;/TD&gt;&lt;TD&gt;Client_Name&lt;/TD&gt;&lt;TD&gt;FF_Sector&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;3I in takeover talks&lt;/TD&gt;&lt;TD&gt;3i Group Plc&lt;/TD&gt;&lt;TD&gt;Global Financial Investors&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;AP Moller Maersk to invest in AI technology&lt;/TD&gt;&lt;TD&gt;Maersk&lt;/TD&gt;&lt;TD&gt;Energy, Resources and Infrastructure&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;to form partnership with EE BT Group&lt;/TD&gt;&lt;TD&gt;BT Group PLC&lt;/TD&gt;&lt;TD&gt;Power, Utilities &amp;amp; Renewables&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;in fines due to miss selling EDF&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;EDF&lt;/TD&gt;&lt;TD&gt;Power, Utilities &amp;amp; Renewables&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Abbott labs to sell off Africa arm&lt;/TD&gt;&lt;TD&gt;Abbot Laboratories&lt;/TD&gt;&lt;TD&gt;Consumer and Healthcare Group&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Bank RBS post profit&lt;/TD&gt;&lt;TD&gt;RBS&lt;/TD&gt;&lt;TD&gt;Banks&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I want to extract out the list of company names from the text and match them to a company name lookup table that I have with cleansed company names. I want the table to look like this;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;Want&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Headline_Title&lt;/TD&gt;&lt;TD&gt;Client_Name&lt;/TD&gt;&lt;TD&gt;FF_Sector&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;3I in takeover talks&lt;/TD&gt;&lt;TD&gt;3i Group Plc&lt;/TD&gt;&lt;TD&gt;Global Financial Investors&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;AP Moller Maersk to invest in AI technology&lt;/TD&gt;&lt;TD&gt;Maersk&lt;/TD&gt;&lt;TD&gt;Energy, Resources and Infrastructure&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;to form partnership with EE BT Group&lt;/TD&gt;&lt;TD&gt;BT Group PLC&lt;/TD&gt;&lt;TD&gt;Power, Utilities &amp;amp; Renewables&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;in fines due to miss selling EDF&amp;nbsp;&lt;/TD&gt;&lt;TD&gt;EDF&lt;/TD&gt;&lt;TD&gt;Power, Utilities &amp;amp; Renewables&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Abbott labs to sell off Africa arm&lt;/TD&gt;&lt;TD&gt;Abbot Laboratories&lt;/TD&gt;&lt;TD&gt;Consumer and Healthcare Group&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;Bank RBS post profit&lt;/TD&gt;&lt;TD&gt;RBS&lt;/TD&gt;&lt;TD&gt;Banks&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have thousands of records to match to and this is just an example.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Would you happen to know what the best way is to do this?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;BR /&gt;&lt;BR /&gt;Chris&lt;/P&gt;</description>
      <pubDate>Wed, 28 Feb 2018 11:13:56 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/440838#M282446</guid>
      <dc:creator>cmoore</dc:creator>
      <dc:date>2018-02-28T11:13:56Z</dc:date>
    </item>
    <item>
      <title>Re: Matching Company Names</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/441027#M282447</link>
      <description>&lt;P&gt;Answers were already given. What is unclear in all the explanations above?&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 28 Feb 2018 21:19:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/441027#M282447</guid>
      <dc:creator>ChrisNZ</dc:creator>
      <dc:date>2018-02-28T21:19:39Z</dc:date>
    </item>
    <item>
      <title>Re: Matching Company Names</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/525986#M282448</link>
      <description>&lt;P&gt;Hello cmoore,&lt;/P&gt;&lt;P&gt;I have a similar problem like this your problem you posted and want to ask if you could share the codes with me so i modify to suit my work. Am trying to do director name matching from two different database. Could you please help me out with your codes that helped you solve this your problem. Thanks very.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 10 Jan 2019 07:36:11 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/525986#M282448</guid>
      <dc:creator>EJAA</dc:creator>
      <dc:date>2019-01-10T07:36:11Z</dc:date>
    </item>
    <item>
      <title>Re: Matching Company Names</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/525990#M282449</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Yes I'll help you know problem. Please could you provide me with 10 examples of data that you are trying to match.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;BR /&gt;&lt;BR /&gt;Chris&lt;/P&gt;</description>
      <pubDate>Thu, 10 Jan 2019 08:07:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/525990#M282449</guid>
      <dc:creator>cmoore</dc:creator>
      <dc:date>2019-01-10T08:07:37Z</dc:date>
    </item>
    <item>
      <title>Re: Matching Company Names</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/525995#M282450</link>
      <description>&lt;P&gt;Hello cmoore, please find below details of my tables, just a small portion of it. I want to merge the two table using the director names which runs into thousands of rows.Thanks in advance for the assistance. Regards Emmanuel.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;TABLE 1&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;exec_fname&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;exec_fullname&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;exec_lname&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;William&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;William P. Sullivan&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Sullivan&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Michael&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Michael R. McMullen&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;McMullen&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Didier&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Didier Hirsch&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Hirsch&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Ronald&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Ronald S. Nersesian&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Nersesian&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Adrian&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Adrian T. Dillon&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Dillon&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Nicolas&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Nicolas H. Roelofs,&amp;nbsp;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Roelofs,&amp;nbsp;&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Ronald&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Ronald S. Nersesian&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Nersesian&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Nicolas&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Nicolas H. Roelofs,&amp;nbsp;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Roelofs,&amp;nbsp;&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Michael&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Michael R. McMullen&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;McMullen&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Table 2&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;DIRECTORNAME&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Degree&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;William P. Sullivan&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;MBA&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Michael R. McMullen&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;BSC&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Didier Hirsch&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;BSC&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Ronald S. Nersesian&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;PHD&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Adrian T. Dillon&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;MSC&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Nicolas H. Roelofs,&amp;nbsp;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;MA&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Ronald S. Nersesian&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;MBA&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Nicolas H. Roelofs,&amp;nbsp;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;MBA&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Michael R. McMullen&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;MBA&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;</description>
      <pubDate>Thu, 10 Jan 2019 08:48:00 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/525995#M282450</guid>
      <dc:creator>EJAA</dc:creator>
      <dc:date>2019-01-10T08:48:00Z</dc:date>
    </item>
    <item>
      <title>Re: Matching Company Names</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/526011#M282451</link>
      <description>&lt;P&gt;Are all the names the same spellings across the tables? Here is some rough code that cleanses the name on both tables then joins on it. If the spellings werent the same you would need to use fuzzy matching techniques.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;data s1;&lt;/P&gt;&lt;P&gt;set s1;&lt;/P&gt;&lt;P&gt;format Cleaned_exec_fullname $255.;&lt;/P&gt;&lt;P&gt;/*Name*/&lt;BR /&gt;Cleaned_exec_fullname = scan(exec_fullname, 1,'()');&lt;BR /&gt;Cleaned_exec_fullname = upcase(Cleaned_exec_fullname);&lt;BR /&gt;Cleaned_exec_fullname = compress(Cleaned_exec_fullname,",,.,/,*,',","");&lt;BR /&gt;Cleaned_exec_fullname = compress(Cleaned_exec_fullname,"""");&lt;/P&gt;&lt;P&gt;Cleaned_exec_fullname = strip(Cleaned_exec_fullname);&lt;BR /&gt;Cleaned_exec_fullname = compbl(Cleaned_exec_fullname);&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;data s2;&lt;/P&gt;&lt;P&gt;set s2;&lt;/P&gt;&lt;P&gt;format Cleaned_directorname $255.;&lt;/P&gt;&lt;P&gt;/* Name*/&lt;BR /&gt;Cleaned_directorname = scan(directorname, 1,'()');&lt;BR /&gt;Cleaned_directorname = upcase(Cleaned_directorname);&lt;BR /&gt;Cleaned_directorname = compress(Cleaned_directorname,",,.,/,*,',","");&lt;BR /&gt;Cleaned_directorname = compress(Cleaned_directorname,"""");&lt;/P&gt;&lt;P&gt;Cleaned_directorname = strip(Cleaned_directorname);&lt;BR /&gt;Cleaned_directorname = compbl(Cleaned_directorname);&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;P&gt;proc sql noprint;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;**Selecting exact matches;&lt;BR /&gt;create table Exact_matches_Comp as&lt;BR /&gt;select s1.exec_fullname as Original_Exec_Name,&lt;BR /&gt;s1.Cleaned_exec_fullname as Cleaned_Exec_Name,&lt;BR /&gt;s2.directorname as Original_Director_Name,&lt;BR /&gt;s2.Cleaned_directorname as Cleaned_Director_Name,&lt;BR /&gt;s2.degree&lt;/P&gt;&lt;P&gt;from s1 inner join s2&lt;BR /&gt;on s1.Cleaned_exec_fullname = s2.Cleaned_directorname&lt;BR /&gt;where s1.Cleaned_exec_fullname is not null;&lt;BR /&gt;quit;&lt;/P&gt;&lt;P&gt;proc sort data=Exact_matches_Comp nodupkey; by Cleaned_Director_Name degree; run;&lt;/P&gt;</description>
      <pubDate>Thu, 10 Jan 2019 10:43:47 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/526011#M282451</guid>
      <dc:creator>cmoore</dc:creator>
      <dc:date>2019-01-10T10:43:47Z</dc:date>
    </item>
    <item>
      <title>Re: Matching Company Names</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/526810#M282452</link>
      <description>&lt;P&gt;hi Cmoore,&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks for sending through the codes and i apologise for the late update. Had no access to my computer during&amp;nbsp; the weekend hence the late reply. Am now going to try the codes you sent.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, the names in both dataset are not almost the same, some differs. What i sent was to aid your understanding of how my dataset looked like. I think fuzzy matching will be the best way out as you mentioned. Any help on that as well? Thanks in advance for the help. Regards EJAA.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 13 Jan 2019 23:45:14 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/526810#M282452</guid>
      <dc:creator>EJAA</dc:creator>
      <dc:date>2019-01-13T23:45:14Z</dc:date>
    </item>
    <item>
      <title>Re: Matching Company Names</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/526820#M282453</link>
      <description>&lt;P&gt;&lt;EM&gt;&amp;gt; the names in both dataset are not almost the same, some differs.&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Why do you supply identical names as examples then? You make it hard to help you.&lt;/P&gt;</description>
      <pubDate>Mon, 14 Jan 2019 02:46:04 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/526820#M282453</guid>
      <dc:creator>ChrisNZ</dc:creator>
      <dc:date>2019-01-14T02:46:04Z</dc:date>
    </item>
    <item>
      <title>Re: Matching Company Names</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/526841#M282454</link>
      <description>&lt;P&gt;Thanks ChrisNZ.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Advice well taken for future correspondence.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Kindly find attached sample of my dataset and thanks in advance for the assistance.regards.ejaa.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 14 Jan 2019 03:22:01 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/526841#M282454</guid>
      <dc:creator>EJAA</dc:creator>
      <dc:date>2019-01-14T03:22:01Z</dc:date>
    </item>
    <item>
      <title>Re: Matching Company Names</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/526847#M282455</link>
      <description>&lt;P&gt;There is no commonality between your two files.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;How is this supposed to help us help you?&lt;/P&gt;</description>
      <pubDate>Mon, 14 Jan 2019 04:04:58 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/526847#M282455</guid>
      <dc:creator>ChrisNZ</dc:creator>
      <dc:date>2019-01-14T04:04:58Z</dc:date>
    </item>
    <item>
      <title>Re: Matching Company Names</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/526849#M282456</link>
      <description>&lt;P&gt;&lt;STRONG&gt;More to the point, have you tried fuzzy matching using the recommended steps above?&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;DIV id="lineardisplaymessageviewwrapper_4" class="lia-linear-display-message-view"&gt;
&lt;DIV class=""&gt;
&lt;DIV id="messageview_4" class="lia-message-view message-uid-439823" data-lia-message-uid="439823"&gt;
&lt;DIV class=""&gt;
&lt;DIV class="lia-js-resize-images lia-component-forums-widget-board-message-view"&gt;
&lt;DIV data-message-id="439823"&gt;
&lt;DIV class="lia-message-board lia-panel-message lia-js-data-messageUid-439823"&gt;
&lt;DIV class="lia-panel-message-content"&gt;
&lt;DIV class="lia-decoration-border"&gt;
&lt;DIV class="lia-decoration-border-content"&gt;
&lt;DIV class="lia-quilt lia-quilt-forum-message lia-quilt-layout-forum-message"&gt;
&lt;DIV class="lia-quilt-row lia-quilt-row-forum-message-main"&gt;
&lt;DIV class="lia-quilt-column lia-quilt-column-20 lia-quilt-column-right lia-quilt-column-main-right"&gt;
&lt;DIV class="lia-quilt-column-alley lia-quilt-column-alley-right"&gt;
&lt;DIV id="messagebodydisplay_0_4" class="lia-message-body lia-component-body"&gt;
&lt;DIV class="lia-message-body-content"&gt;
&lt;P&gt;&lt;EM&gt;Some tips:&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;1. Upcase everything&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;2. Change all &amp;amp; or other symbols such as&amp;nbsp;@ from text&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;3. Ensure Co and/or LTD are standardized, i.e. LTD vs LTD. vs Limited vs Limited =&amp;gt; This could be replaced to deal with middle names and/or initials, in the case of names&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;4. Do an exact match and remove those records&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;5. Remove all spaces and try a second round of exact matches&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;6. Do fuzzy matching using COMPGED/COMPLEV/SOUNDS LIKE/SPEDIS and equality of first letter&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;7. Do fuzzy matching using COMPGED/COMPLEV/SOUNDS LIKE&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;COMPGED works best in my experience&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;as well as&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/136670"&gt;@cmoore&lt;/a&gt;'s cleaning steps (extract, upcase, compress, justify, compl) to implement point 2 above?&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;Cleaned_directorname = scan(directorname, 1,'()');&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;Cleaned_directorname = upcase(Cleaned_directorname);&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;Cleaned_directorname = compress(Cleaned_directorname,",,.,/,*,',","");&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;Cleaned_directorname = compress(Cleaned_directorname,"""");&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;Cleaned_directorname = strip(Cleaned_directorname);&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;Cleaned_directorname = compbl(Cleaned_directorname);&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;[Edited formatting]&lt;/P&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;</description>
      <pubDate>Mon, 14 Jan 2019 21:29:06 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/526849#M282456</guid>
      <dc:creator>ChrisNZ</dc:creator>
      <dc:date>2019-01-14T21:29:06Z</dc:date>
    </item>
    <item>
      <title>Re: Matching Company Names</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/526883#M282457</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The kind of coding you could you use to fuzzy match on name can be seen below. You just need to amend your variables to fit your data tables;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;/*Non-Exact Matches - Use the Algorithm to match variables - on Company Names*/&lt;/P&gt;&lt;P&gt;proc sql;&lt;/P&gt;&lt;P&gt;create table NonExact_matches_Comp as&lt;/P&gt;&lt;P&gt;select s1.company_name as Original_Company_Name,&lt;BR /&gt;s1.cleaned_company_name as Cleaned_Company_name,&lt;BR /&gt;s2.cleaned_company_name as connect_company_name,&lt;BR /&gt;s2.cleaned_group_name as connect_group_name,&lt;BR /&gt;s2.client_group_number,&lt;BR /&gt;compged(s1.cleaned_company_name,s2.cleaned_company_name,999,'LN')as compged,&lt;BR /&gt;spedis(s1.cleaned_company_name,s2.cleaned_company_name)as spedis&lt;BR /&gt;&lt;BR /&gt;from s1&lt;BR /&gt;inner join&lt;BR /&gt;s2&lt;BR /&gt;on (compged(s1.cleaned_company_name,s2.cleaned_company_name,999,'LN') &amp;lt; 200&lt;BR /&gt;and spedis(s1.cleaned_company_name,s2.cleaned_company_name) &amp;lt; 50)&lt;BR /&gt;group by 1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17&lt;BR /&gt;having calculated compged = min(compged(s1.cleaned_company_name,s2.cleaned_company_name,999,'LN'))&lt;BR /&gt;;&lt;BR /&gt;quit;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 14 Jan 2019 09:01:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/526883#M282457</guid>
      <dc:creator>cmoore</dc:creator>
      <dc:date>2019-01-14T09:01:39Z</dc:date>
    </item>
    <item>
      <title>Re: Matching Company Names</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/527132#M282458</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/136670"&gt;@cmoore&lt;/a&gt;&amp;nbsp; This should be a multi-step process, going from best to worst match quality, as outlined above.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;This ensures faster processing as well as better understanding of the quality of the results.&lt;/P&gt;</description>
      <pubDate>Mon, 14 Jan 2019 21:31:47 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/527132#M282458</guid>
      <dc:creator>ChrisNZ</dc:creator>
      <dc:date>2019-01-14T21:31:47Z</dc:date>
    </item>
    <item>
      <title>Re: Matching Company Names</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/527237#M282459</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/16961"&gt;@ChrisNZ&lt;/a&gt;&amp;nbsp;yes total agree with your logic and those are the steps that I used to write my matching programme. The code above is just a snippet of the code that I used at the very end of the process where I needed to use fuzzy logic on poor data that was left after data cleansing and removing exact matches.&lt;/P&gt;</description>
      <pubDate>Tue, 15 Jan 2019 08:47:24 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Matching-Company-Names/m-p/527237#M282459</guid>
      <dc:creator>cmoore</dc:creator>
      <dc:date>2019-01-15T08:47:24Z</dc:date>
    </item>
  </channel>
</rss>

