<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Help deduplicate after probabilistic linkage in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Help-deduplicate-after-probabilistic-linkage/m-p/573324#M161844</link>
    <description>&lt;P&gt;Thank you,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Very correct in my typological error when posting. ID_A 123 would not inherit the match score from ID_A 127 and ID_B 777.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;But your solution is what I am looking for!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;You saved my A$$.&lt;/P&gt;</description>
    <pubDate>Sat, 13 Jul 2019 22:07:00 GMT</pubDate>
    <dc:creator>Schilker</dc:creator>
    <dc:date>2019-07-13T22:07:00Z</dc:date>
    <item>
      <title>Help deduplicate after probabilistic linkage</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Help-deduplicate-after-probabilistic-linkage/m-p/573311#M161834</link>
      <description>&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;data one;
  input ID_A ID_B Match_score;
cards;
123 777 28.1
124 778 15.6 
125 787 19.7
125 799 18.9
126 762 36.1
127 762 55.1
127 777 28.7
128 999 19.5
129 781 18.2
;&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;I just performed probabilistic linkage on two datasets. The output dataset called "one", contains the identification number from both original datasets,&amp;nbsp; ID_A, the other ID_B, with a linkage score "match_score". There are numerous combinations of ID_A and ID_B. I want to select only the top linkage to pair then remove them from the selection process for further linkages. An ideal output would be...&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;ID_A&amp;nbsp; &amp;nbsp; &amp;nbsp;ID_B&amp;nbsp; &amp;nbsp; &amp;nbsp;Match_score&amp;nbsp;&lt;/P&gt;&lt;P&gt;127&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;762&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;55.1&lt;/P&gt;&lt;P&gt;123&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;777&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;28.1&lt;/P&gt;&lt;P&gt;125&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;787&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;19.7&lt;/P&gt;&lt;P&gt;128&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;999&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;19.5&lt;/P&gt;&lt;P&gt;129&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;781&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;18.2&lt;/P&gt;&lt;P&gt;124&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;778&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;15.6&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;ID_A: 126 wouldn't match because of the ID_B (762),&amp;nbsp; match_score is higher for another ID_A (127).&lt;/P&gt;&lt;P&gt;ID_B: 799 wouldn't match because ID_A(125) had a larger match_score with (787)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Any help would be greatly appreciated!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Edited 28.7 to 28.1...thanks for pointing out&lt;/P&gt;</description>
      <pubDate>Sat, 13 Jul 2019 22:05:08 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Help-deduplicate-after-probabilistic-linkage/m-p/573311#M161834</guid>
      <dc:creator>Schilker</dc:creator>
      <dc:date>2019-07-13T22:05:08Z</dc:date>
    </item>
    <item>
      <title>Re: Help deduplicate after probabilistic linkage</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Help-deduplicate-after-probabilistic-linkage/m-p/573315#M161837</link>
      <description>&lt;P&gt;I vaguely remember the TOP notch people in the forum asking if you have &lt;STRONG&gt;SAS OR&lt;/STRONG&gt; that has the procs for networking problems that solves at ease. I have seen PG, Ksharp , Robpratt answering such questions frequently.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I had &lt;STRONG&gt;fun&lt;/STRONG&gt; attempting in this thread a while ago-&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Dec,17,2018 attempt&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://communities.sas.com/t5/SAS-Programming/SAS-unique-identifier-for-multiple-columns-with-same-values/td-p/519970/page/2" target="_blank" rel="noopener"&gt;https://communities.sas.com/t5/SAS-Programming/SAS-unique-identifier-for-multiple-columns-with-same-values/td-p/519970/page/2&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;So I tried c&lt;STRONG&gt;opy pasting and plugging&amp;nbsp; your dataset with a tweak&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;

data have;
  input (Cust dev) (:$8.) Match_score;
cards;
123 777 28.1
124 778 15.6 
125 787 19.7
125 799 18.9
126 762 36.1
127 762 55.1
127 777 28.7
128 999 19.5
129 781 18.2
;
proc sort data=have;
by cust dev;
run;
dm log 'clear';

data _null_;
if _n_=1 then do;
  if 0 then set  have have(rename=(dev=_dev cust=_cust));
  length __dev UID __cust $5;
	/*H initial full load*/
  dcl hash H (dataset:'have(rename=(dev=_dev cust=_cust))',multidata:'y') ;
   h.definekey  ("_dev") ;
   h.definedata ("_cust", "_dev","match_score") ;
   h.definedone () ;
   dcl hiter hh('h');
   	/*H1 Look up and load part by part load proceeding to our need*/
   dcl hash H1 (multidata:'y',ordered:'y') ;
   h1.definekey  ("__cust") ;
   h1.definedata ("__cust", "__dev","match_score","UID") ;
   h1.definedone () ;
   dcl hiter hh1('h1');
   call missing(__dev,__cust);
end;
array t(999) $5;
array j(999) $5;
do until(last.cust);
	set have end=l;
	by cust;
	if first.cust then 
	do;
		if  h1.find(key:cust)= 0 then f=1;
		if not f then do; c+1; uid= cats( 'UID', put(c,8. -l));end;
	end;
	if not f then 
	do;
		/*Look from dev to cust and collect cust residuals in array*/
			do rc1=h.find(key:dev) by 0 while(rc1=0);
				rc=h1.add(key:_cust,data:_cust,data:_dev,data:match_score,data:uid);
		/*collect cust residuals in array that's not part of cust but part of device*/
				if _cust ne cust then do;n+1;t(n)=_cust;end;
				rc1=h.find_next();
			end;
	end;
end;
n=0;
if not f then 
do;
	/*Residual look up-start with cust and iterate with dev*cust recursively until all check complete*/
	do until(sum(cmiss(of t(*)),cmiss(of j(*)))=dim(t)*2);
		do i=1 to dim(t);
			if not missing(t(i)) then 
	/*iterate the full load h using hh*/
				do while(hh.next()=0);
					if t(i)=_cust then
					do;
		/*Check if residual cust's* dev is not in part load hash,if not then fetch*/
						k=.;
						do while(hh1.next()=0);
						if __dev = _dev then do; k=1;leave;end;
						end;
						if not  k then do;n1+1;j(n1)=_dev;end;
						rc=h1.add(key:_cust,data:t(i),data:_dev,data:match_score,data:uid);
					end;	
				end;
		end;
		call missing(of t(*));n1=0;
		do i=1 to dim(j);
			if not missing(j(i)) then 
				do rc1=h.find(key:j(i)) by 0 while(rc1=0);
			/*Check if residual dev's *cust is not in part load hash,if not then fetch*/
					if h1.check(key:_cust) ne 0 then do;n+1;t(n)=_cust;end;
					rc=h1.add(key:_cust,data:_cust,data:_dev,data:match_score,data:uid);
					rc1=h.find_next();
				end;
		end;
		call missing(of j(*));n=0;
	end;
end;
if l then h1.output(dataset:'want');
run;

proc sql;
create table final_want as
select __cust as ID_A,__dev as ID_B, Match_score
from want 
group by uid
having match_score=max(match_score);
quit;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;&lt;STRONG&gt;FINAL_WANT&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;TABLE width="156"&gt;
&lt;TBODY&gt;
&lt;TR&gt;
&lt;TD width="36"&gt;&lt;STRONG&gt;ID_A&lt;/STRONG&gt;&lt;/TD&gt;
&lt;TD width="35"&gt;&lt;STRONG&gt;ID_B&lt;/STRONG&gt;&lt;/TD&gt;
&lt;TD width="85"&gt;&lt;STRONG&gt;Match_score&lt;/STRONG&gt;&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;127&lt;/TD&gt;
&lt;TD&gt;762&lt;/TD&gt;
&lt;TD&gt;55.1&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;124&lt;/TD&gt;
&lt;TD&gt;778&lt;/TD&gt;
&lt;TD&gt;15.6&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;125&lt;/TD&gt;
&lt;TD&gt;787&lt;/TD&gt;
&lt;TD&gt;19.7&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;128&lt;/TD&gt;
&lt;TD&gt;999&lt;/TD&gt;
&lt;TD&gt;19.5&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;129&lt;/TD&gt;
&lt;TD&gt;781&lt;/TD&gt;
&lt;TD&gt;18.2&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 13 Jul 2019 20:46:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Help-deduplicate-after-probabilistic-linkage/m-p/573315#M161837</guid>
      <dc:creator>novinosrin</dc:creator>
      <dc:date>2019-07-13T20:46:29Z</dc:date>
    </item>
    <item>
      <title>Re: Help deduplicate after probabilistic linkage</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Help-deduplicate-after-probabilistic-linkage/m-p/573318#M161838</link>
      <description>&lt;P&gt;This is great except it excludes the match ID_A (123) and ID_B(777). Because ID_B (777) wasn't the top match for ID_A (127), it should be used to match with the next best possible ID_A, which would be ID_A (123).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 13 Jul 2019 21:29:00 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Help-deduplicate-after-probabilistic-linkage/m-p/573318#M161838</guid>
      <dc:creator>Schilker</dc:creator>
      <dc:date>2019-07-13T21:29:00Z</dc:date>
    </item>
    <item>
      <title>Re: Help deduplicate after probabilistic linkage</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Help-deduplicate-after-probabilistic-linkage/m-p/573321#M161841</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/115825"&gt;@Schilker&lt;/a&gt;&amp;nbsp; &amp;nbsp;I think my understanding is perhaps wrong and consequently your req may not be easily plugged to the code&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;So from this&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;TABLE width="220"&gt;
&lt;TBODY&gt;
&lt;TR&gt;
&lt;TD width="36"&gt;ID_A&lt;/TD&gt;
&lt;TD width="35"&gt;ID_B&lt;/TD&gt;
&lt;TD width="85"&gt;Match_score&lt;/TD&gt;
&lt;TD width="64"&gt;Group&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;123&lt;/TD&gt;
&lt;TD&gt;777&lt;/TD&gt;
&lt;TD&gt;28.1&lt;/TD&gt;
&lt;TD&gt;UID1&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;124&lt;/TD&gt;
&lt;TD&gt;778&lt;/TD&gt;
&lt;TD&gt;15.6&lt;/TD&gt;
&lt;TD&gt;UID2&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;125&lt;/TD&gt;
&lt;TD&gt;787&lt;/TD&gt;
&lt;TD&gt;19.7&lt;/TD&gt;
&lt;TD&gt;UID3&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;125&lt;/TD&gt;
&lt;TD&gt;799&lt;/TD&gt;
&lt;TD&gt;18.9&lt;/TD&gt;
&lt;TD&gt;UID3&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;126&lt;/TD&gt;
&lt;TD&gt;762&lt;/TD&gt;
&lt;TD&gt;36.1&lt;/TD&gt;
&lt;TD&gt;UID1&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;126&lt;/TD&gt;
&lt;TD&gt;762&lt;/TD&gt;
&lt;TD&gt;36.1&lt;/TD&gt;
&lt;TD&gt;UID1&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;127&lt;/TD&gt;
&lt;TD&gt;777&lt;/TD&gt;
&lt;TD&gt;28.7&lt;/TD&gt;
&lt;TD&gt;UID1&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;127&lt;/TD&gt;
&lt;TD&gt;762&lt;/TD&gt;
&lt;TD&gt;28.7&lt;/TD&gt;
&lt;TD&gt;UID1&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;127&lt;/TD&gt;
&lt;TD&gt;777&lt;/TD&gt;
&lt;TD&gt;28.1&lt;/TD&gt;
&lt;TD&gt;UID1&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;127&lt;/TD&gt;
&lt;TD&gt;762&lt;/TD&gt;
&lt;TD&gt;55.1&lt;/TD&gt;
&lt;TD&gt;UID1&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;128&lt;/TD&gt;
&lt;TD&gt;999&lt;/TD&gt;
&lt;TD&gt;19.5&lt;/TD&gt;
&lt;TD&gt;UID4&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;129&lt;/TD&gt;
&lt;TD&gt;781&lt;/TD&gt;
&lt;TD&gt;18.2&lt;/TD&gt;
&lt;TD&gt;UID5&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You can see , i took the max of the match_score for each By group. Therefore your "&lt;EM&gt;Because ID_B (777) wasn't the top match for ID_A (127), it should be used to match with the next best possible ID_A, which would be ID_A (123)."&amp;nbsp;&lt;/EM&gt;is not clear to me or the grouping logic is not supposed to the be approach to follow&lt;/P&gt;</description>
      <pubDate>Sat, 13 Jul 2019 21:39:38 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Help-deduplicate-after-probabilistic-linkage/m-p/573321#M161841</guid>
      <dc:creator>novinosrin</dc:creator>
      <dc:date>2019-07-13T21:39:38Z</dc:date>
    </item>
    <item>
      <title>Re: Help deduplicate after probabilistic linkage</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Help-deduplicate-after-probabilistic-linkage/m-p/573322#M161842</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/115825"&gt;@Schilker&lt;/a&gt;&amp;nbsp;and welcome to the SAS Support Communities!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Isn't this just a matter of two lookup tables to check if either component of a matched pair has occurred before (i.e. with a higher match score), after sorting the pairs by descending match score?&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;proc sort data=one;
by descending match_score ID_A ID_B;
run;

data want;
if _n_=1 then do;
  dcl hash ha();
  ha.definekey('ID_A');
  ha.definedone();
  dcl hash hb();
  hb.definekey('ID_B');
  hb.definedone();
end;
set one;
if ha.check()*hb.check() then do;
  output;
  ha.add();
  hb.add();
end;
run;

proc print data=want noobs;
run;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;Result:&lt;/P&gt;
&lt;PRE&gt;                Match_
ID_A    ID_B     score

 127     762     55.1
 123     777     28.&lt;STRONG&gt;&lt;FONT color="#FF0000"&gt;1&lt;/FONT&gt;&lt;/STRONG&gt;
 125     787     19.7
 128     999     19.5
 129     781     18.2
 124     778     15.6&lt;/PRE&gt;
&lt;P&gt;However, if your sample output is correct, then mine is wrong (see highlighted decimal above). But why should the pair (12&lt;STRONG&gt;3&lt;/STRONG&gt;, 777) "inherit" match score 28.&lt;FONT color="#FF0000"&gt;7&lt;/FONT&gt;, which was computed for the pair (12&lt;STRONG&gt;7&lt;/STRONG&gt;, 777)?&lt;/P&gt;</description>
      <pubDate>Sat, 13 Jul 2019 21:48:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Help-deduplicate-after-probabilistic-linkage/m-p/573322#M161842</guid>
      <dc:creator>FreelanceReinh</dc:creator>
      <dc:date>2019-07-13T21:48:31Z</dc:date>
    </item>
    <item>
      <title>Re: Help deduplicate after probabilistic linkage</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Help-deduplicate-after-probabilistic-linkage/m-p/573324#M161844</link>
      <description>&lt;P&gt;Thank you,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Very correct in my typological error when posting. ID_A 123 would not inherit the match score from ID_A 127 and ID_B 777.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;But your solution is what I am looking for!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;You saved my A$$.&lt;/P&gt;</description>
      <pubDate>Sat, 13 Jul 2019 22:07:00 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Help-deduplicate-after-probabilistic-linkage/m-p/573324#M161844</guid>
      <dc:creator>Schilker</dc:creator>
      <dc:date>2019-07-13T22:07:00Z</dc:date>
    </item>
    <item>
      <title>Re: Help deduplicate after probabilistic linkage</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Help-deduplicate-after-probabilistic-linkage/m-p/573329#M161847</link>
      <description>&lt;P&gt;Great, but one remaining issue might be &lt;EM&gt;tied&lt;/EM&gt; match scores. In this case the sort order would not be uniquely determined by the scores. (As you see, I used ID_A and ID_B, in ascending order, as potential tie breakers.) But the order &lt;EM&gt;does&lt;/EM&gt; have an impact on which of the subsequent pairs are discarded.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Example:&lt;/P&gt;
&lt;PRE&gt;data one;
input ID_A ID_B Match_score;
cards;
&lt;STRONG&gt;&lt;FONT color="#008000"&gt;101 201 100&lt;/FONT&gt;&lt;/STRONG&gt;
101 202 100
102 201  90
&lt;FONT color="#008000"&gt;&lt;STRONG&gt;103 202  80&lt;/STRONG&gt;&lt;/FONT&gt;
103 203  75
;&lt;/PRE&gt;
&lt;P&gt;With the current algorithm only the two observations highlighted in green would be selected. However, the only reason why the second obs. (&lt;FONT face="courier new,courier"&gt;101 202 100&lt;/FONT&gt;) is discarded is its higher ID_B, which is rather arbitrary, isn't it? But if the second obs. &lt;EM&gt;had&lt;/EM&gt; been selected (and the first discarded instead), the third and fifth obs. would have been selected as well rather than the fourth. So, the result would have been different both in terms of match scores (80 vs. 90 and 75) and number of selected observations! Would this be a problem?&lt;/P&gt;</description>
      <pubDate>Sat, 13 Jul 2019 22:58:04 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Help-deduplicate-after-probabilistic-linkage/m-p/573329#M161847</guid>
      <dc:creator>FreelanceReinh</dc:creator>
      <dc:date>2019-07-13T22:58:04Z</dc:date>
    </item>
    <item>
      <title>Re: Help deduplicate after probabilistic linkage</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Help-deduplicate-after-probabilistic-linkage/m-p/573363#M161867</link>
      <description>&lt;P&gt;Why you pick 123 not 127 ? since 28.7 &amp;gt; 28.1&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE class=" language-sas"&gt;&lt;CODE class="  language-sas"&gt;&lt;SPAN class="token datalines"&gt;&lt;SPAN class="token data string"&gt;123 777 28.1
...........
127 777 28.7&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 14 Jul 2019 11:45:15 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Help-deduplicate-after-probabilistic-linkage/m-p/573363#M161867</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2019-07-14T11:45:15Z</dc:date>
    </item>
    <item>
      <title>Re: Help deduplicate after probabilistic linkage</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Help-deduplicate-after-probabilistic-linkage/m-p/573365#M161868</link>
      <description>Because 127 had a better match&lt;BR /&gt;</description>
      <pubDate>Sun, 14 Jul 2019 12:29:19 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Help-deduplicate-after-probabilistic-linkage/m-p/573365#M161868</guid>
      <dc:creator>Schilker</dc:creator>
      <dc:date>2019-07-14T12:29:19Z</dc:date>
    </item>
  </channel>
</rss>

