<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Will data step automatically remove duplicates in SAS Enterprise Guide</title>
    <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/Will-data-step-automatically-remove-duplicates/m-p/156871#M12212</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;When I write:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;data work.temp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp; set work.temp1&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; work.temp2&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; work.temp3;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp; keep col1 col2 col3 col4 col5; &lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Will it only keep unique rows in the final output or will it keep duplicates? Also how is the above code different from proc append?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Tue, 13 May 2014 14:44:05 GMT</pubDate>
    <dc:creator>eagles_dare13</dc:creator>
    <dc:date>2014-05-13T14:44:05Z</dc:date>
    <item>
      <title>Will data step automatically remove duplicates</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/Will-data-step-automatically-remove-duplicates/m-p/156871#M12212</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;When I write:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;data work.temp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp; set work.temp1&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; work.temp2&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; work.temp3;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp; keep col1 col2 col3 col4 col5; &lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Will it only keep unique rows in the final output or will it keep duplicates? Also how is the above code different from proc append?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 13 May 2014 14:44:05 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/Will-data-step-automatically-remove-duplicates/m-p/156871#M12212</guid>
      <dc:creator>eagles_dare13</dc:creator>
      <dc:date>2014-05-13T14:44:05Z</dc:date>
    </item>
    <item>
      <title>Re: Will data step automatically remove duplicates</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/Will-data-step-automatically-remove-duplicates/m-p/156872#M12213</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;No, it will not.&amp;nbsp; Use proc sort with nodupkey, or SQL distinct to get rid of duplicates.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;You code will create a new file, open each in the set and write out to the new file.&amp;nbsp; Not very read/write optimized.&amp;nbsp; Proc append doesn't open the file, just sets them together so less I/O operation.&amp;nbsp; The proc append will throw warnings if&amp;nbsp; the dataset formats are not exact or if there are more variables.&amp;nbsp; Dataset will warn on some things, but will expand the output table with any new columns without warning.&amp;nbsp; Best check the guidance for complete run down as there are plus/minus points on each/&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 13 May 2014 14:49:08 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/Will-data-step-automatically-remove-duplicates/m-p/156872#M12213</guid>
      <dc:creator>RW9</dc:creator>
      <dc:date>2014-05-13T14:49:08Z</dc:date>
    </item>
  </channel>
</rss>

