<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to identify duplicates for data with IDs as a group of variables in SAS Data Management</title>
    <link>https://communities.sas.com/t5/SAS-Data-Management/How-to-identify-duplicates-for-data-with-IDs-as-a-group-of/m-p/199653#M4393</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Just sort by all three vars:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;proc sort data=have;&lt;/P&gt;&lt;P&gt;&amp;nbsp; by var1 var2 var3;&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;data dups nodups;&lt;/P&gt;&lt;P&gt;set have;&lt;/P&gt;&lt;P&gt;by var1 var2 var3;&lt;/P&gt;&lt;P&gt;if first.var3 and last.var3 then output nodups;&lt;/P&gt;&lt;P&gt;else output dups;&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Thu, 04 Jun 2015 06:38:41 GMT</pubDate>
    <dc:creator>AskoLötjönen</dc:creator>
    <dc:date>2015-06-04T06:38:41Z</dc:date>
    <item>
      <title>How to identify duplicates for data with IDs as a group of variables</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/How-to-identify-duplicates-for-data-with-IDs-as-a-group-of/m-p/199652#M4392</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;My data has IDs as a group of variables, let's say 3. It means each variable may not be unique individually, but together every 3 variables specifies a unique observation. It looks like this:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Var1&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Var2&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Var3&lt;/P&gt;&lt;P&gt;John&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; PHIL&amp;nbsp;&amp;nbsp;&amp;nbsp; PA&lt;/P&gt;&lt;P&gt;Mike&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; PHIL&amp;nbsp;&amp;nbsp;&amp;nbsp; PA&lt;/P&gt;&lt;P&gt;John&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; CHIC&amp;nbsp;&amp;nbsp;&amp;nbsp; IL&lt;/P&gt;&lt;P&gt;John&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; PHIL&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; PA&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;You will see that observations 1 and 4 are duplicates and there comes the question: How can I identify duplicates from this data?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If it's some single ID variable I can do:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;data dups nodups;&lt;/P&gt;&lt;P&gt;set have;&lt;/P&gt;&lt;P&gt;by ID;&lt;/P&gt;&lt;P&gt;if first.ID and last.ID then output nodups;&lt;/P&gt;&lt;P&gt;else output dups;&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;But I'm not sure how to do in this situation. I can sort them out one by one by wonder if there's a more efficient way.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 04 Jun 2015 06:31:59 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/How-to-identify-duplicates-for-data-with-IDs-as-a-group-of/m-p/199652#M4392</guid>
      <dc:creator>NonSleeper</dc:creator>
      <dc:date>2015-06-04T06:31:59Z</dc:date>
    </item>
    <item>
      <title>Re: How to identify duplicates for data with IDs as a group of variables</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/How-to-identify-duplicates-for-data-with-IDs-as-a-group-of/m-p/199653#M4393</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Just sort by all three vars:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;proc sort data=have;&lt;/P&gt;&lt;P&gt;&amp;nbsp; by var1 var2 var3;&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;data dups nodups;&lt;/P&gt;&lt;P&gt;set have;&lt;/P&gt;&lt;P&gt;by var1 var2 var3;&lt;/P&gt;&lt;P&gt;if first.var3 and last.var3 then output nodups;&lt;/P&gt;&lt;P&gt;else output dups;&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 04 Jun 2015 06:38:41 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/How-to-identify-duplicates-for-data-with-IDs-as-a-group-of/m-p/199653#M4393</guid>
      <dc:creator>AskoLötjönen</dc:creator>
      <dc:date>2015-06-04T06:38:41Z</dc:date>
    </item>
    <item>
      <title>Re: How to identify duplicates for data with IDs as a group of variables</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/How-to-identify-duplicates-for-data-with-IDs-as-a-group-of/m-p/199654#M4394</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Oh...Wao...Yeh...&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I think I'm gonna go home. Well, no : )&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 04 Jun 2015 06:42:57 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/How-to-identify-duplicates-for-data-with-IDs-as-a-group-of/m-p/199654#M4394</guid>
      <dc:creator>NonSleeper</dc:creator>
      <dc:date>2015-06-04T06:42:57Z</dc:date>
    </item>
  </channel>
</rss>

