<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to match COMPGED scores to 100 or less in left join? in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/How-to-match-COMPGED-scores-to-100-or-less-in-left-join/m-p/768700#M243845</link>
    <description>&lt;P&gt;Have you looked the calculation of COMGED?&lt;/P&gt;
&lt;P&gt;&lt;A href="https://documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/lefunctionsref/p1r4l9jwgatggtn1ko81fyjys4s7.htm" target="_blank"&gt;https://documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/lefunctionsref/p1r4l9jwgatggtn1ko81fyjys4s7.htm&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;Inserting one character costs 200 points. A score lower than 100 is for almost identical strings, which your data clearly isn't have for the values shown.&lt;/P&gt;</description>
    <pubDate>Tue, 21 Sep 2021 04:35:51 GMT</pubDate>
    <dc:creator>ChrisNZ</dc:creator>
    <dc:date>2021-09-21T04:35:51Z</dc:date>
    <item>
      <title>How to match COMPGED scores to 100 or less in left join?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/How-to-match-COMPGED-scores-to-100-or-less-in-left-join/m-p/768587#M243792</link>
      <description>&lt;P&gt;Hi~&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;To match COMPGED scores to 100 or less in legt join, I used compged function as below.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;============================================================================&lt;BR /&gt;libname proj "D:\SASWORK";&lt;BR /&gt;proc sql; *Left join ccmmatch with s34match via cusip;&lt;BR /&gt;create table dualmatch&lt;BR /&gt;as select a.*, b.mgrname, b.fdate, b.shares, b.shrout2, compged (a.lender, b.mgrname, 'I') as match&lt;BR /&gt;from proj.ccmmatch a left join proj.s34match b&lt;BR /&gt;on (a.cusip=b.s34cusip1) and (compged (a.lender, b.mgrname) le 100) and (year(a.FacilityStartDate)=year(b.FDATE)) and (month(a.FacilityStartDate) between month(b.FDATE)-2 and month(b.FDATE));&lt;BR /&gt;quit;&lt;BR /&gt;============================================================================&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;There is no errors in this code.&amp;nbsp;However, my COBPGED scores are much larger than 100 as follows.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="match.PNG" style="width: 101px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/63743iB69E77993C92499B/image-size/large?v=v2&amp;amp;px=999" role="button" title="match.PNG" alt="match.PNG" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;I would really appreciate it if anyone could give me a help.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Mon, 20 Sep 2021 15:33:45 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/How-to-match-COMPGED-scores-to-100-or-less-in-left-join/m-p/768587#M243792</guid>
      <dc:creator>Smile1</dc:creator>
      <dc:date>2021-09-20T15:33:45Z</dc:date>
    </item>
    <item>
      <title>Re: How to match COMPGED scores to 100 or less in left join?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/How-to-match-COMPGED-scores-to-100-or-less-in-left-join/m-p/768594#M243794</link>
      <description>&lt;P&gt;Hello &lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/397532"&gt;@Smile1&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;COMPGED costs are very quickly above 100.&lt;/P&gt;
&lt;P&gt;Who has told you that you need to have a&amp;nbsp;&lt;SPAN style="font-family: inherit;"&gt;generalized edit distance below 100?&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN style="font-family: inherit;"&gt;You can influence the COMPGED costs yourself with the&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN style="font-family: inherit;"&gt;COMPCOST function or the CALL&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN style="font-family: inherit;"&gt;COMPCOST routine. You can also use some modifiers in the COMPGED function to compare case insensitive and / or include / exclude blanks and quotation marks in the comparison.&lt;BR /&gt;&lt;BR /&gt;There's probably nothing wrong with your code, but if you are above 100, you are above 100 (and no results may be returned from the join).&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN style="font-family: inherit;"&gt;Kind regards,&lt;BR /&gt;Koen&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 20 Sep 2021 16:00:25 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/How-to-match-COMPGED-scores-to-100-or-less-in-left-join/m-p/768594#M243794</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2021-09-20T16:00:25Z</dc:date>
    </item>
    <item>
      <title>Re: How to match COMPGED scores to 100 or less in left join?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/How-to-match-COMPGED-scores-to-100-or-less-in-left-join/m-p/768660#M243819</link>
      <description>&lt;P&gt;You might be interested in this example:&lt;/P&gt;
&lt;PRE&gt;data example;
   x='word';
   y='wood';
   z=compged(x,y);
run;&lt;/PRE&gt;
&lt;P&gt;Comparing two 4-letter words with a single letter different: Compged value is 100. It takes very little to exceed 100 with Compged scores.&lt;/P&gt;
&lt;P&gt;The little I use this function for I generally look for scores less than 800 for only people names. Your variable name Lender makes one suspect that could be a company name with all sorts of potential for including punctuation like period, comma or semicolon and special characters like &amp;amp; (A&amp;amp;B Bank or such). I would seldom expect a "lender" and "mgrname", which I would guess to be "manager's name" or similar, to be very close with Compged scores.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Perhaps you need SPEDIS which generates smaller differences due to a different algorithm. Or accept a larger Compged score.&lt;/P&gt;</description>
      <pubDate>Mon, 20 Sep 2021 21:03:23 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/How-to-match-COMPGED-scores-to-100-or-less-in-left-join/m-p/768660#M243819</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2021-09-20T21:03:23Z</dc:date>
    </item>
    <item>
      <title>Re: How to match COMPGED scores to 100 or less in left join?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/How-to-match-COMPGED-scores-to-100-or-less-in-left-join/m-p/768700#M243845</link>
      <description>&lt;P&gt;Have you looked the calculation of COMGED?&lt;/P&gt;
&lt;P&gt;&lt;A href="https://documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/lefunctionsref/p1r4l9jwgatggtn1ko81fyjys4s7.htm" target="_blank"&gt;https://documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/lefunctionsref/p1r4l9jwgatggtn1ko81fyjys4s7.htm&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;Inserting one character costs 200 points. A score lower than 100 is for almost identical strings, which your data clearly isn't have for the values shown.&lt;/P&gt;</description>
      <pubDate>Tue, 21 Sep 2021 04:35:51 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/How-to-match-COMPGED-scores-to-100-or-less-in-left-join/m-p/768700#M243845</guid>
      <dc:creator>ChrisNZ</dc:creator>
      <dc:date>2021-09-21T04:35:51Z</dc:date>
    </item>
    <item>
      <title>Re: How to match COMPGED scores to 100 or less in left join?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/How-to-match-COMPGED-scores-to-100-or-less-in-left-join/m-p/770160#M244306</link>
      <description>Hi Ballardw,&lt;BR /&gt;&lt;BR /&gt;Thank you for your response. Yes, it takes very little to exceed 100 with Compged scores..&lt;BR /&gt;But I think maybe there is a way to standardize COMPGED costs based on 0 - 100.&lt;BR /&gt;It is no doubt that we can easily recognize which name is close to the original if we can measure COMPGED costs based on 0 - 100, rather than 0 - infinite.&lt;BR /&gt;&lt;BR /&gt;Maybe I need to learn how to use SPEDIS if I can't find a way to standardize COMPGED costs based on 0 - 100.&lt;BR /&gt;Thank you for your help!</description>
      <pubDate>Fri, 24 Sep 2021 08:46:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/How-to-match-COMPGED-scores-to-100-or-less-in-left-join/m-p/770160#M244306</guid>
      <dc:creator>Smile1</dc:creator>
      <dc:date>2021-09-24T08:46:18Z</dc:date>
    </item>
    <item>
      <title>Re: How to match COMPGED scores to 100 or less in left join?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/How-to-match-COMPGED-scores-to-100-or-less-in-left-join/m-p/770166#M244309</link>
      <description>Hi ChrisNZ,&lt;BR /&gt;&lt;BR /&gt;Thank you for your response. Yes, A score lower than 100 is for almost identical strings....&lt;BR /&gt;But I think maybe there is a way to standardize COMPGED costs based on 0 - 100.&lt;BR /&gt;It is no doubt that we can easily recognize which name is close to the original if we can measure COMPGED costs based on 0 - 100, rather than 0 - infinite.&lt;BR /&gt;&lt;BR /&gt;I will try different ways as well. Thank you for your help!</description>
      <pubDate>Fri, 24 Sep 2021 09:16:52 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/How-to-match-COMPGED-scores-to-100-or-less-in-left-join/m-p/770166#M244309</guid>
      <dc:creator>Smile1</dc:creator>
      <dc:date>2021-09-24T09:16:52Z</dc:date>
    </item>
    <item>
      <title>Re: How to match COMPGED scores to 100 or less in left join?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/How-to-match-COMPGED-scores-to-100-or-less-in-left-join/m-p/770167#M244310</link>
      <description>Hi Koen,&lt;BR /&gt;&lt;BR /&gt;Thank you for your response. Yes, COMPGED costs are very quickly above 100.&lt;BR /&gt;But I think maybe there is a way to standardize COMPGED costs based on 0 - 100.&lt;BR /&gt;It is no doubt that we can easily recognize which name is close to the original if we can measure COMPGED costs based on 0 - 100, rather than 0 - infinite.&lt;BR /&gt;&lt;BR /&gt;I will try different ways as well. Thank you for your help!</description>
      <pubDate>Fri, 24 Sep 2021 09:17:36 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/How-to-match-COMPGED-scores-to-100-or-less-in-left-join/m-p/770167#M244310</guid>
      <dc:creator>Smile1</dc:creator>
      <dc:date>2021-09-24T09:17:36Z</dc:date>
    </item>
  </channel>
</rss>

