<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Which is best in performance Creating Match Codes in Data Flux Vs Base SAS code Deploy on Server in SAS Data Management</title>
    <link>https://communities.sas.com/t5/SAS-Data-Management/Which-is-best-in-performance-Creating-Match-Codes-in-Data-Flux/m-p/502294#M15595</link>
    <description>&lt;P&gt;Match code generation is a very resource intensive process. It always uses DataFlux whether you call this functionality now out of SAS or directly out of a DF job so I don't believe going for a DF job will improve performance.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;What you could do to improve end-to-end runtimes:&lt;/P&gt;
&lt;P&gt;1. Set-up parallel jobs each creating match-codes for a sub-set of your source&amp;nbsp;data&lt;/P&gt;
&lt;P&gt;2. Design and implement delta processing so don't re-create all match codes every single time but only create match codes for new or changed records.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 08 Oct 2018 01:55:14 GMT</pubDate>
    <dc:creator>Patrick</dc:creator>
    <dc:date>2018-10-08T01:55:14Z</dc:date>
    <item>
      <title>Which is best in performance Creating Match Codes in Data Flux Vs Base SAS code Deploy on Server</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Which-is-best-in-performance-Creating-Match-Codes-in-Data-Flux/m-p/501714#M15581</link>
      <description>&lt;P&gt;Hi Everyone,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In our organisation we have to create match codes for huge to data(million records) for clustering.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I can understand you can create match codes and clustering using Data Management Studio and schedule on DM Server.&lt;/P&gt;&lt;P&gt;Currently, the development has been done on SAS code(using DQmatch function) job and scheduled on the server.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I can't justify which one is best approach to improve performance on same server. Using SAS code is taking hours to run.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So I am just wondering has anyone experience similar situation or have any information around this.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please any assistance would be really appreciated.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Rama&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 05 Oct 2018 00:17:36 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Which-is-best-in-performance-Creating-Match-Codes-in-Data-Flux/m-p/501714#M15581</guid>
      <dc:creator>Rama_V</dc:creator>
      <dc:date>2018-10-05T00:17:36Z</dc:date>
    </item>
    <item>
      <title>Re: Which is best in performance Creating Match Codes in Data Flux Vs Base SAS code Deploy on Server</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Which-is-best-in-performance-Creating-Match-Codes-in-Data-Flux/m-p/502294#M15595</link>
      <description>&lt;P&gt;Match code generation is a very resource intensive process. It always uses DataFlux whether you call this functionality now out of SAS or directly out of a DF job so I don't believe going for a DF job will improve performance.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;What you could do to improve end-to-end runtimes:&lt;/P&gt;
&lt;P&gt;1. Set-up parallel jobs each creating match-codes for a sub-set of your source&amp;nbsp;data&lt;/P&gt;
&lt;P&gt;2. Design and implement delta processing so don't re-create all match codes every single time but only create match codes for new or changed records.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 08 Oct 2018 01:55:14 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Which-is-best-in-performance-Creating-Match-Codes-in-Data-Flux/m-p/502294#M15595</guid>
      <dc:creator>Patrick</dc:creator>
      <dc:date>2018-10-08T01:55:14Z</dc:date>
    </item>
    <item>
      <title>Re: Which is best in performance Creating Match Codes in Data Flux Vs Base SAS code Deploy on Server</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Which-is-best-in-performance-Creating-Match-Codes-in-Data-Flux/m-p/502566#M15600</link>
      <description>&lt;P&gt;Thanks Patrick.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I really appreciate your inputs.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 09 Oct 2018 02:19:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Which-is-best-in-performance-Creating-Match-Codes-in-Data-Flux/m-p/502566#M15600</guid>
      <dc:creator>Rama_V</dc:creator>
      <dc:date>2018-10-09T02:19:55Z</dc:date>
    </item>
  </channel>
</rss>

