<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Identifying duplicates from two or more sets of data in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Identifying-duplicates-from-two-or-more-sets-of-data/m-p/46990#M9701</link>
    <description>PROC SORT has a DUPOUT= parameter so you can re-direct the duplicates to  a different file.  The other option, depending on your needs, is to use a DATA step, with a BY statement, and use the IF statement test for FIRST.ID and LAST.ID in order to perform some desired processing logic.&lt;BR /&gt;
&lt;BR /&gt;
Scott Barry&lt;BR /&gt;
SBBWorks, Inc.</description>
    <pubDate>Wed, 17 Sep 2008 16:17:30 GMT</pubDate>
    <dc:creator>sbb</dc:creator>
    <dc:date>2008-09-17T16:17:30Z</dc:date>
    <item>
      <title>Identifying duplicates from two or more sets of data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Identifying-duplicates-from-two-or-more-sets-of-data/m-p/46989#M9700</link>
      <description>Hi. I'm trying to compare two sets of data. The common field that i'm using for comparison is the ID field. I'd like to be able to identify which ID's are duplicates. I think i could possibly use proc sort and nodupkey... but that would delete the observations instead of identifying them. &lt;BR /&gt;
i've used  proc compare but it only seems to compare the variables not the observations... any ideas? Here's my compare code:&lt;BR /&gt;
&lt;BR /&gt;
proc compare base = work.A compare = work.B;&lt;BR /&gt;
id IDCODE;&lt;BR /&gt;
run;</description>
      <pubDate>Wed, 17 Sep 2008 14:56:05 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Identifying-duplicates-from-two-or-more-sets-of-data/m-p/46989#M9700</guid>
      <dc:creator>mitch</dc:creator>
      <dc:date>2008-09-17T14:56:05Z</dc:date>
    </item>
    <item>
      <title>Re: Identifying duplicates from two or more sets of data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Identifying-duplicates-from-two-or-more-sets-of-data/m-p/46990#M9701</link>
      <description>PROC SORT has a DUPOUT= parameter so you can re-direct the duplicates to  a different file.  The other option, depending on your needs, is to use a DATA step, with a BY statement, and use the IF statement test for FIRST.ID and LAST.ID in order to perform some desired processing logic.&lt;BR /&gt;
&lt;BR /&gt;
Scott Barry&lt;BR /&gt;
SBBWorks, Inc.</description>
      <pubDate>Wed, 17 Sep 2008 16:17:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Identifying-duplicates-from-two-or-more-sets-of-data/m-p/46990#M9701</guid>
      <dc:creator>sbb</dc:creator>
      <dc:date>2008-09-17T16:17:30Z</dc:date>
    </item>
    <item>
      <title>Re: Identifying duplicates from two or more sets of data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Identifying-duplicates-from-two-or-more-sets-of-data/m-p/46991#M9702</link>
      <description>Thanks a lot! I ended up breaking into PROC SQL and joining the two datasets then using ODS to outsheet the dups. I'm playing with PROC SORT Dupout so I can know how to use it for future. &lt;BR /&gt;
&lt;BR /&gt;
I appreciate your suggestions.</description>
      <pubDate>Wed, 17 Sep 2008 21:42:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Identifying-duplicates-from-two-or-more-sets-of-data/m-p/46991#M9702</guid>
      <dc:creator>mitch</dc:creator>
      <dc:date>2008-09-17T21:42:35Z</dc:date>
    </item>
  </channel>
</rss>

