<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: how to remove duplicates in SAS Enterprise Miner in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/how-to-remove-duplicates-in-SAS-Enterprise-Miner/m-p/450580#M6849</link>
    <description>&lt;P&gt;Thank you so much Mike,&lt;/P&gt;&lt;P&gt;Much appreciated.&lt;/P&gt;</description>
    <pubDate>Tue, 03 Apr 2018 12:09:11 GMT</pubDate>
    <dc:creator>MoeYousefi</dc:creator>
    <dc:date>2018-04-03T12:09:11Z</dc:date>
    <item>
      <title>how to remove duplicates in SAS Enterprise Miner</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/how-to-remove-duplicates-in-SAS-Enterprise-Miner/m-p/450225#M6845</link>
      <description>&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;I'm fairly new to SAS E-Miner and was just wondering if you could help me out with my query of " how to eliminate duplicated records in SAS E-Miner?"&lt;/P&gt;&lt;P&gt;Many thanks,&lt;/P&gt;&lt;P&gt;Moe&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 01 Apr 2018 04:40:49 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/how-to-remove-duplicates-in-SAS-Enterprise-Miner/m-p/450225#M6845</guid>
      <dc:creator>MoeYousefi</dc:creator>
      <dc:date>2018-04-01T04:40:49Z</dc:date>
    </item>
    <item>
      <title>Re: how to remove duplicates in SAS Enterprise Miner</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/how-to-remove-duplicates-in-SAS-Enterprise-Miner/m-p/450408#M6846</link>
      <description>&lt;P&gt;Hello MoeYousefi-&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Enterprise Miner does not have specific functionality for removing duplicate observations.&amp;nbsp; However, you can run a SAS Code node and invoke PROC SORT with the NODUPKEY option.&amp;nbsp;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;Example:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;- Add a SAS Code node to your flow.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;- Select the Code Editor property.&amp;nbsp; Enter code like this:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; proc sort nodupkey data=&amp;amp;EM_IMPORT_DATA out=&amp;amp;EM_EXPORT_TRAIN;&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;var &amp;lt; list of variables that define unique vs duplicate &amp;gt;;&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;run;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;- Close the node.&amp;nbsp; Run the node.&amp;nbsp; Continue your flow.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp;The NODUPKEY option tells PROC SORT to keep only unique rows as defined by the variables on the VAR statement.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp; &amp;nbsp;&lt;SPAN&gt;&amp;amp;EM_IMPORT_DATA is a SAS Code node macro variable that resolves to the data source that is coming in to the SAS Code node.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp;&amp;amp;EM_EXPORT_TRAIN is a SAS Code node macro variable that resolves to the data source that is created by the SAS Code node.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;There is no real advantage to running PROC SORT in a SAS Code node in this specific scenario.&amp;nbsp; In fact, you might be better served by running PROC SORT in the coding job that prepares the data set for use in Enterprise Miner.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Have a great week!&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 02 Apr 2018 17:05:56 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/how-to-remove-duplicates-in-SAS-Enterprise-Miner/m-p/450408#M6846</guid>
      <dc:creator>MikeStockstill</dc:creator>
      <dc:date>2018-04-02T17:05:56Z</dc:date>
    </item>
    <item>
      <title>Re: how to remove duplicates in SAS Enterprise Miner</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/how-to-remove-duplicates-in-SAS-Enterprise-Miner/m-p/450580#M6849</link>
      <description>&lt;P&gt;Thank you so much Mike,&lt;/P&gt;&lt;P&gt;Much appreciated.&lt;/P&gt;</description>
      <pubDate>Tue, 03 Apr 2018 12:09:11 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/how-to-remove-duplicates-in-SAS-Enterprise-Miner/m-p/450580#M6849</guid>
      <dc:creator>MoeYousefi</dc:creator>
      <dc:date>2018-04-03T12:09:11Z</dc:date>
    </item>
  </channel>
</rss>

