<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to get duplicate records as output data set using proc sql only? in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/How-to-get-duplicate-records-as-output-data-set-using-proc-sql/m-p/647418#M193775</link>
    <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/304446"&gt;@Saikiran_Mamidi&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;In proc sql only ?&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;Why be stupid? Isn't it much better to do the clever thing?&lt;/P&gt;</description>
    <pubDate>Wed, 13 May 2020 11:35:15 GMT</pubDate>
    <dc:creator>Kurt_Bremser</dc:creator>
    <dc:date>2020-05-13T11:35:15Z</dc:date>
    <item>
      <title>How to get duplicate records as output data set using proc sql only?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/How-to-get-duplicate-records-as-output-data-set-using-proc-sql/m-p/647386#M193751</link>
      <description>&lt;P&gt;Data have;&lt;/P&gt;
&lt;P&gt;input id;&lt;/P&gt;
&lt;P&gt;cards;&lt;/P&gt;
&lt;P&gt;1&lt;/P&gt;
&lt;P&gt;1&lt;/P&gt;
&lt;P&gt;1&lt;/P&gt;
&lt;P&gt;2&lt;/P&gt;
&lt;P&gt;2&lt;/P&gt;
&lt;P&gt;44&lt;/P&gt;
&lt;P&gt;44&lt;/P&gt;
&lt;P&gt;44&lt;/P&gt;
&lt;P&gt;44&lt;/P&gt;
&lt;P&gt;; run;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;duplicate records in output dataset by using only proc sql?&lt;/P&gt;
&lt;P&gt;like below&lt;/P&gt;
&lt;P&gt;data want&lt;/P&gt;
&lt;P&gt;id&lt;/P&gt;
&lt;P&gt;1&lt;/P&gt;
&lt;P&gt;1&lt;/P&gt;
&lt;P&gt;2&lt;/P&gt;
&lt;P&gt;44&lt;/P&gt;
&lt;P&gt;44&lt;/P&gt;
&lt;P&gt;44&lt;/P&gt;
&lt;P&gt;can i get code for this ? thanks in advance.&lt;/P&gt;</description>
      <pubDate>Wed, 13 May 2020 10:25:47 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/How-to-get-duplicate-records-as-output-data-set-using-proc-sql/m-p/647386#M193751</guid>
      <dc:creator>Saikiran_Mamidi</dc:creator>
      <dc:date>2020-05-13T10:25:47Z</dc:date>
    </item>
    <item>
      <title>How to get duplicate records as output dataset without using proc sort? only using proc sql?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/How-to-get-duplicate-records-as-output-data-set-using-proc-sql/m-p/647385#M193755</link>
      <description>&lt;P&gt;Data have;&lt;/P&gt;
&lt;P&gt;input id;&lt;/P&gt;
&lt;P&gt;cards;&lt;/P&gt;
&lt;P&gt;1&lt;/P&gt;
&lt;P&gt;1&lt;/P&gt;
&lt;P&gt;1&lt;/P&gt;
&lt;P&gt;2&lt;/P&gt;
&lt;P&gt;2&lt;/P&gt;
&lt;P&gt;44&lt;/P&gt;
&lt;P&gt;44&lt;/P&gt;
&lt;P&gt;44&lt;/P&gt;
&lt;P&gt;44&lt;/P&gt;
&lt;P&gt;; run;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;duplicate records in output dataset by using only proc sql?&lt;/P&gt;
&lt;P&gt;like below&lt;/P&gt;
&lt;P&gt;data want&lt;/P&gt;
&lt;P&gt;id&lt;/P&gt;
&lt;P&gt;1&lt;/P&gt;
&lt;P&gt;1&lt;/P&gt;
&lt;P&gt;2&lt;/P&gt;
&lt;P&gt;44&lt;/P&gt;
&lt;P&gt;44&lt;/P&gt;
&lt;P&gt;44&lt;/P&gt;
&lt;P&gt;can i get code for this ? thanks in advance.&lt;/P&gt;</description>
      <pubDate>Wed, 13 May 2020 10:22:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/How-to-get-duplicate-records-as-output-data-set-using-proc-sql/m-p/647385#M193755</guid>
      <dc:creator>Saikiran_Mamidi</dc:creator>
      <dc:date>2020-05-13T10:22:29Z</dc:date>
    </item>
    <item>
      <title>Re: How to get duplicate records as output data set using proc sql only?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/How-to-get-duplicate-records-as-output-data-set-using-proc-sql/m-p/647388#M193753</link>
      <description>You still have duplicates in Want, haven't you?&lt;BR /&gt;&lt;BR /&gt;Bart</description>
      <pubDate>Wed, 13 May 2020 10:29:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/How-to-get-duplicate-records-as-output-data-set-using-proc-sql/m-p/647388#M193753</guid>
      <dc:creator>yabwon</dc:creator>
      <dc:date>2020-05-13T10:29:12Z</dc:date>
    </item>
    <item>
      <title>Re: How to get duplicate records as output data set using proc sql only?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/How-to-get-duplicate-records-as-output-data-set-using-proc-sql/m-p/647392#M193758</link>
      <description>&lt;P&gt;What if, in addition to variable ID, you have other "satellite" variables A, B, and C, which vary within a given ID?&amp;nbsp; What criterion would you use to select which records are the extra ones?&lt;/P&gt;</description>
      <pubDate>Wed, 13 May 2020 10:32:26 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/How-to-get-duplicate-records-as-output-data-set-using-proc-sql/m-p/647392#M193758</guid>
      <dc:creator>mkeintz</dc:creator>
      <dc:date>2020-05-13T10:32:26Z</dc:date>
    </item>
    <item>
      <title>Re: How to get duplicate records as output dataset without using proc sort? only using proc sql?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/How-to-get-duplicate-records-as-output-data-set-using-proc-sql/m-p/647393#M193759</link>
      <description>&lt;P&gt;Why limit yourself to SQL?&lt;/P&gt;
&lt;P&gt;What have you tried so far?&lt;/P&gt;</description>
      <pubDate>Wed, 13 May 2020 10:34:13 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/How-to-get-duplicate-records-as-output-data-set-using-proc-sql/m-p/647393#M193759</guid>
      <dc:creator>LinusH</dc:creator>
      <dc:date>2020-05-13T10:34:13Z</dc:date>
    </item>
    <item>
      <title>Re: How to get duplicate records as output dataset without using proc sort? only using proc sql?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/How-to-get-duplicate-records-as-output-data-set-using-proc-sql/m-p/647396#M193761</link>
      <description>&lt;P&gt;Something like this:&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;Data have;
input id;
cards;
1
1
1
2
2
44
44
44
44
; run;


proc sql;
  create table tmp as
  select id, monotonic() as i from have
  order by id
  ;

  select a.* 
  from
  tmp as a
  group by a.id
  having a.i &amp;gt; min(a.i)
  ;

  delete table tmp;
quit;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;Not the very optimal I'd say.&lt;/P&gt;
&lt;P&gt;Bart&lt;/P&gt;</description>
      <pubDate>Wed, 13 May 2020 10:41:04 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/How-to-get-duplicate-records-as-output-data-set-using-proc-sql/m-p/647396#M193761</guid>
      <dc:creator>yabwon</dc:creator>
      <dc:date>2020-05-13T10:41:04Z</dc:date>
    </item>
    <item>
      <title>Re: How to get duplicate records as output data set using proc sql only?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/How-to-get-duplicate-records-as-output-data-set-using-proc-sql/m-p/647413#M193771</link>
      <description>&lt;P&gt;Not using a data step for this is outright stupid:&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;data want;
set have;
by id;
if first.id or not last.id;
run;&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Wed, 13 May 2020 11:28:02 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/How-to-get-duplicate-records-as-output-data-set-using-proc-sql/m-p/647413#M193771</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2020-05-13T11:28:02Z</dc:date>
    </item>
    <item>
      <title>Re: How to get duplicate records as output data set using proc sql only?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/How-to-get-duplicate-records-as-output-data-set-using-proc-sql/m-p/647415#M193773</link>
      <description>In proc sql only  ?</description>
      <pubDate>Wed, 13 May 2020 11:30:11 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/How-to-get-duplicate-records-as-output-data-set-using-proc-sql/m-p/647415#M193773</guid>
      <dc:creator>Saikiran_Mamidi</dc:creator>
      <dc:date>2020-05-13T11:30:11Z</dc:date>
    </item>
    <item>
      <title>Re: How to get duplicate records as output data set using proc sql only?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/How-to-get-duplicate-records-as-output-data-set-using-proc-sql/m-p/647418#M193775</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/304446"&gt;@Saikiran_Mamidi&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;In proc sql only ?&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;Why be stupid? Isn't it much better to do the clever thing?&lt;/P&gt;</description>
      <pubDate>Wed, 13 May 2020 11:35:15 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/How-to-get-duplicate-records-as-output-data-set-using-proc-sql/m-p/647418#M193775</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2020-05-13T11:35:15Z</dc:date>
    </item>
    <item>
      <title>Re: How to get duplicate records as output data set using proc sql only?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/How-to-get-duplicate-records-as-output-data-set-using-proc-sql/m-p/647454#M193791</link>
      <description>&lt;P&gt;Well, if have has 600 million records but only 1 million unique key value, you would would want to avoid sorting the 600 million records.&lt;/P&gt;
&lt;P&gt;If that is the scenario, you could read WANT sequentially only once, use a datastep with hashing to look up whether you have a match.&amp;nbsp;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If so, insert the first match into the hash table (of course the other non-key&amp;nbsp; saved variables might not be want you want).&lt;/P&gt;
&lt;P&gt;At then at the end you output the hash table to a physical table.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You need to ensure that the million full rows will fit into memory.&amp;nbsp; &amp;nbsp; &amp;nbsp;If that is not possible then&lt;/P&gt;
&lt;P&gt;create a macro that read HAVE in n pieces using options firstobs and obs.&amp;nbsp; &amp;nbsp;&lt;/P&gt;
&lt;P&gt;Then you need a final part that re-reads the reduced pieces and consolidates them into a final HAVE.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 13 May 2020 13:11:08 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/How-to-get-duplicate-records-as-output-data-set-using-proc-sql/m-p/647454#M193791</guid>
      <dc:creator>DavePrinsloo</dc:creator>
      <dc:date>2020-05-13T13:11:08Z</dc:date>
    </item>
  </channel>
</rss>

