<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Eliminate duplicates if a condition is satisfied in SAS Procedures</title>
    <link>https://communities.sas.com/t5/SAS-Procedures/Eliminate-duplicates-if-a-condition-is-satisfied/m-p/145516#M38707</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Well simplest I can think of right now is:&lt;/P&gt;&lt;P&gt;data Test; &lt;BR /&gt;&amp;nbsp;&amp;nbsp; input identifier $ order condition; &lt;BR /&gt;&amp;nbsp;&amp;nbsp; datalines;&lt;BR /&gt;1023 1 0&lt;BR /&gt;1023 2 0&lt;BR /&gt;1098 1 0&lt;BR /&gt;1098 1 1&lt;BR /&gt;;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;BR /&gt;run;&lt;/P&gt;&lt;P&gt;proc sql undo_policy=none;&lt;BR /&gt;&amp;nbsp; delete from TEST A&lt;BR /&gt;&amp;nbsp; where not exists(select distinct THIS.IDENTIFIER from TEST THIS where THIS.IDENTIFIER=A.IDENTIFIER and THIS.CONDITION=1)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; and ORDER ne (select max(THIS.ORDER) from TEST THIS where THIS.IDENTIFIER=A.IDENTIFIER);&lt;BR /&gt;quit;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Tue, 28 Oct 2014 15:26:35 GMT</pubDate>
    <dc:creator>RW9</dc:creator>
    <dc:date>2014-10-28T15:26:35Z</dc:date>
    <item>
      <title>Eliminate duplicates if a condition is satisfied</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Eliminate-duplicates-if-a-condition-is-satisfied/m-p/145513#M38704</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P style="margin: 0 0 1em; font-size: 14px; color: #000000; font-family: Arial, 'Liberation Sans', 'DejaVu Sans', sans-serif; background: #ffffff;"&gt;I want to eliminate duplicates from a database, based on an identifier, an order and a condition.&lt;/P&gt;&lt;P style="margin: 0 0 1em; font-size: 14px; color: #000000; font-family: Arial, 'Liberation Sans', 'DejaVu Sans', sans-serif; background: #ffffff;"&gt;More precisely, if a condition is met (I can create a variable that equals 1 if it is and 0 otherwise), I would like to select one unique observation per identifier, based by its order (the last one). If this condition is not met then I want to keep all the observations related to this identifier.&lt;/P&gt;&lt;P style="margin: 0 0 1em; font-size: 14px; color: #000000; font-family: Arial, 'Liberation Sans', 'DejaVu Sans', sans-serif; background: #ffffff;"&gt;Without the condition I can do that (or use a proc sql but that is not point)&lt;/P&gt;&lt;PRE style="margin: 0 0 10px; padding: 5px; font-size: 14px; font-family: Consolas, Menlo, Monaco, 'Lucida Console', 'Liberation Mono', 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Courier New', monospace, serif; color: #000000; background: #eeeeee;"&gt;
&lt;P&gt;&lt;CODE style="font-family: Consolas, Menlo, Monaco, 'Lucida Console', 'Liberation Mono', 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Courier New', monospace, serif; background-position: initial;"&gt;proc sort data=have; &lt;/CODE&gt;&lt;/P&gt;
&lt;P&gt;&lt;CODE style="font-family: Consolas, Menlo, Monaco, 'Lucida Console', 'Liberation Mono', 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Courier New', monospace, serif; background-position: initial;"&gt; by identifier descending order; &lt;/CODE&gt;&lt;/P&gt;
&lt;P&gt;&lt;CODE style="font-family: Consolas, Menlo, Monaco, 'Lucida Console', 'Liberation Mono', 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Courier New', monospace, serif; background-position: initial;"&gt;run; &lt;/CODE&gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;
&lt;P&gt;&lt;CODE style="font-family: Consolas, Menlo, Monaco, 'Lucida Console', 'Liberation Mono', 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Courier New', monospace, serif; background-position: initial;"&gt;proc sort nudopkey data=have; &lt;/CODE&gt;&lt;/P&gt;
&lt;P&gt;&lt;CODE style="font-family: Consolas, Menlo, Monaco, 'Lucida Console', 'Liberation Mono', 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Courier New', monospace, serif; background-position: initial;"&gt; by identifier; &lt;/CODE&gt;&lt;/P&gt;
&lt;P&gt;&lt;CODE style="font-family: Consolas, Menlo, Monaco, 'Lucida Console', 'Liberation Mono', 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Courier New', monospace, serif; background-position: initial;"&gt;run; &lt;/CODE&gt;&lt;/P&gt;

&lt;/PRE&gt;&lt;P style="margin: 0 0 1em; font-size: 14px; color: #000000; font-family: Arial, 'Liberation Sans', 'DejaVu Sans', sans-serif; background: #ffffff;"&gt;But how to incorporate my condition in this ?&lt;/P&gt;&lt;P style="margin: 0 0 1em; font-size: 14px; color: #000000; font-family: Arial, 'Liberation Sans', 'DejaVu Sans', sans-serif; background: #ffffff;"&gt;&lt;/P&gt;&lt;P style="margin: 0 0 1em; font-size: 14px; color: #000000; font-family: Arial, 'Liberation Sans', 'DejaVu Sans', sans-serif; background: #ffffff;"&gt;For instance, with this dataset &lt;/P&gt;&lt;P style="margin: 0 0 1em; font-size: 14px; color: #000000; font-family: Arial, 'Liberation Sans', 'DejaVu Sans', sans-serif; background: #ffffff;"&gt;&lt;/P&gt;&lt;P&gt;data Test; &lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp; input identifier $ order condition; &lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp; datalines;&lt;/P&gt;&lt;P&gt;1023 1 0&lt;/P&gt;&lt;P&gt;1023 2 0&lt;/P&gt;&lt;P&gt;1098 1 0&lt;/P&gt;&lt;P&gt;1098 1 1&lt;/P&gt;&lt;P&gt;;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I would like to keep the lines :&lt;/P&gt;&lt;P&gt;-1023 2 0&lt;/P&gt;&lt;P&gt;-1098 1 0&lt;/P&gt;&lt;P&gt;-1098 1 1&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 28 Oct 2014 13:58:45 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Eliminate-duplicates-if-a-condition-is-satisfied/m-p/145513#M38704</guid>
      <dc:creator>Aboiron</dc:creator>
      <dc:date>2014-10-28T13:58:45Z</dc:date>
    </item>
    <item>
      <title>Re: Eliminate duplicates if a condition is satisfied</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Eliminate-duplicates-if-a-condition-is-satisfied/m-p/145514#M38705</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Do you mean you have a flag 1 or 0 in your data?&amp;nbsp; If so then you could just add a where flag=1.&amp;nbsp; You will probably have to do two steps, one for those with the condition true -&amp;gt; find max(), and those without.&lt;/P&gt;&lt;P&gt;proc sql;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; create table WANT as&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; select&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; *&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; from&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; HAVE&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; where&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; *condition is false*&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; union all&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; select&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; *&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; from&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; HAVE&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; where&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; *condition is true*&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; having&amp;nbsp;&amp;nbsp;&amp;nbsp; XYZ=(select max(XYZ) ...);&lt;/P&gt;&lt;P&gt;quit;&amp;nbsp;&amp;nbsp; &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;You could also do it in datastep, sort by condition max() value, then if first row is true the set output flag to 1 else 0.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Some test data might help.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 28 Oct 2014 14:17:46 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Eliminate-duplicates-if-a-condition-is-satisfied/m-p/145514#M38705</guid>
      <dc:creator>RW9</dc:creator>
      <dc:date>2014-10-28T14:17:46Z</dc:date>
    </item>
    <item>
      <title>Re: Eliminate duplicates if a condition is satisfied</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Eliminate-duplicates-if-a-condition-is-satisfied/m-p/145515#M38706</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;added a minimal example.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If I use a where statement in my proc nodupkey, it removes my observations I want to keep&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 28 Oct 2014 14:40:25 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Eliminate-duplicates-if-a-condition-is-satisfied/m-p/145515#M38706</guid>
      <dc:creator>Aboiron</dc:creator>
      <dc:date>2014-10-28T14:40:25Z</dc:date>
    </item>
    <item>
      <title>Re: Eliminate duplicates if a condition is satisfied</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Eliminate-duplicates-if-a-condition-is-satisfied/m-p/145516#M38707</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Well simplest I can think of right now is:&lt;/P&gt;&lt;P&gt;data Test; &lt;BR /&gt;&amp;nbsp;&amp;nbsp; input identifier $ order condition; &lt;BR /&gt;&amp;nbsp;&amp;nbsp; datalines;&lt;BR /&gt;1023 1 0&lt;BR /&gt;1023 2 0&lt;BR /&gt;1098 1 0&lt;BR /&gt;1098 1 1&lt;BR /&gt;;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;BR /&gt;run;&lt;/P&gt;&lt;P&gt;proc sql undo_policy=none;&lt;BR /&gt;&amp;nbsp; delete from TEST A&lt;BR /&gt;&amp;nbsp; where not exists(select distinct THIS.IDENTIFIER from TEST THIS where THIS.IDENTIFIER=A.IDENTIFIER and THIS.CONDITION=1)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; and ORDER ne (select max(THIS.ORDER) from TEST THIS where THIS.IDENTIFIER=A.IDENTIFIER);&lt;BR /&gt;quit;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 28 Oct 2014 15:26:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Eliminate-duplicates-if-a-condition-is-satisfied/m-p/145516#M38707</guid>
      <dc:creator>RW9</dc:creator>
      <dc:date>2014-10-28T15:26:35Z</dc:date>
    </item>
    <item>
      <title>Re: Eliminate duplicates if a condition is satisfied</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Eliminate-duplicates-if-a-condition-is-satisfied/m-p/145517#M38708</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;PRE&gt;data Test; 
&amp;nbsp;&amp;nbsp; input identifier $ order condition; 
&amp;nbsp;&amp;nbsp; datalines;
1023 1 0
1023 2 0
1098 1 0
1098 1 1
;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 
run;
data want(drop=found);
 do until(last.identifier);
&amp;nbsp; set test;
&amp;nbsp; by identifier;
&amp;nbsp; if&amp;nbsp; condition then found=1;
 end; 
 do until(last.identifier);
&amp;nbsp; set test;
&amp;nbsp; by identifier;
&amp;nbsp; if not found then do; if last.identifier then output;end;
&amp;nbsp;&amp;nbsp; else output;
 end;
run;

&lt;/PRE&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Xia Keshan&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 29 Oct 2014 12:26:26 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Eliminate-duplicates-if-a-condition-is-satisfied/m-p/145517#M38708</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2014-10-29T12:26:26Z</dc:date>
    </item>
    <item>
      <title>Re: Eliminate duplicates if a condition is satisfied</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Eliminate-duplicates-if-a-condition-is-satisfied/m-p/145518#M38709</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;data have (index=(pkey=(identifier&amp;nbsp; condition order))); &lt;BR /&gt;&amp;nbsp;&amp;nbsp; input identifier $ order condition; &lt;BR /&gt;&amp;nbsp;&amp;nbsp; datalines;&lt;BR /&gt;2093 1 0&lt;BR /&gt;2093 2 0&lt;BR /&gt;1098 1 0&lt;BR /&gt;1098 1 1&lt;BR /&gt;1065 2 0&lt;BR /&gt;1065 1 0&lt;BR /&gt;1065 1 3&lt;BR /&gt;;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/P&gt;&lt;P&gt;data want;&lt;BR /&gt;set have;&lt;BR /&gt;by identifier condition;&lt;BR /&gt;if last.condition;&lt;BR /&gt;run;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 29 Oct 2014 12:39:14 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Eliminate-duplicates-if-a-condition-is-satisfied/m-p/145518#M38709</guid>
      <dc:creator>Loko</dc:creator>
      <dc:date>2014-10-29T12:39:14Z</dc:date>
    </item>
  </channel>
</rss>

