<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Performance Management/Fine tuning of DataFlux Jobs in SAS Data Management</title>
    <link>https://communities.sas.com/t5/SAS-Data-Management/Performance-Management-Fine-tuning-of-DataFlux-Jobs/m-p/140263#M2163</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi Shawn&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have put that "Lookup Map..."&amp;nbsp; in front of branch. Its executing a bit faster. Anything apart from that which can be implemented, such as changing any of the advanced properties or the configuration files??&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Thu, 16 Jan 2014 08:31:24 GMT</pubDate>
    <dc:creator>Abhigyan</dc:creator>
    <dc:date>2014-01-16T08:31:24Z</dc:date>
    <item>
      <title>Performance Management/Fine tuning of DataFlux Jobs</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Performance-Management-Fine-tuning-of-DataFlux-Jobs/m-p/140259#M2159</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi there!!&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have created a complex job which branches out to around 10 flows. It starts with a SQL read node which is reading data from a SQL Server database. The table from which data is being read contains around 3.2 million records. A filter has been added and around 1.6 million records are being written. After reading, the data is being processed for quality.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;All these is taking up a lot of time - approximately &lt;STRONG&gt;20 minutes for reading the data&lt;/STRONG&gt; and around&lt;STRONG&gt; 4 hours for processing the data and completing the job&lt;/STRONG&gt;.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Is there any way that DataFlux can be configured to complete jobs/read and process data at a much faster way? Can any configurations be done in order to fine tune the jobs?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have attached a screenshot to illustrate the time the job is taking.&lt;/P&gt;&lt;BR /&gt;&lt;IMG src="https://communities.sas.com/t5/image/serverpage/image-id/11800iFCACF7BCAD023C28/image-size/large?v=1.0&amp;amp;px=600" border="0" alt="Time.png" title="Time.png" /&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 14 Jan 2014 11:59:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Performance-Management-Fine-tuning-of-DataFlux-Jobs/m-p/140259#M2159</guid>
      <dc:creator>Abhigyan</dc:creator>
      <dc:date>2014-01-14T11:59:12Z</dc:date>
    </item>
    <item>
      <title>Re: Performance Management/Fine tuning of DataFlux Jobs</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Performance-Management-Fine-tuning-of-DataFlux-Jobs/m-p/140260#M2160</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Abhigyan,&lt;/P&gt;&lt;P&gt;Can you provide a sample of the logic you are executing following the branch node?&lt;/P&gt;&lt;P&gt;Are you doing any filtering in the 'Add GAR...' expression node? If so add that logic to the Data Input node.&lt;/P&gt;&lt;P&gt;What is being done in the embedded jobs?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Give us some additional details and we can help you further.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Preformance can also be impacted by the hardware capabilities of the machine and should be a consideration as well.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;-shawn&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 14 Jan 2014 14:30:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Performance-Management-Fine-tuning-of-DataFlux-Jobs/m-p/140260#M2160</guid>
      <dc:creator>skillman</dc:creator>
      <dc:date>2014-01-14T14:30:21Z</dc:date>
    </item>
    <item>
      <title>Re: Performance Management/Fine tuning of DataFlux Jobs</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Performance-Management-Fine-tuning-of-DataFlux-Jobs/m-p/140261#M2161</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hey Shawn&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks a lot for the reply.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Following the branch node, i am just adding a few columns and passing some values to those columns. In the "Add GAR.." expression node, I am adding a column. No filtering over there.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;In the embedded jobs, I am using a data job to check for a certain pattern using a if else statements.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;if(ascii(x)==49 or ascii(x)==50 . . . . . .&amp;nbsp; or ascii(x)=57)&lt;/P&gt;&lt;P&gt;return true&lt;/P&gt;&lt;P&gt;else&lt;/P&gt;&lt;P&gt;return false&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 15 Jan 2014 06:02:14 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Performance-Management-Fine-tuning-of-DataFlux-Jobs/m-p/140261#M2161</guid>
      <dc:creator>Abhigyan</dc:creator>
      <dc:date>2014-01-15T06:02:14Z</dc:date>
    </item>
    <item>
      <title>Re: Performance Management/Fine tuning of DataFlux Jobs</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Performance-Management-Fine-tuning-of-DataFlux-Jobs/m-p/140262#M2162</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Abhigyan,&lt;/P&gt;&lt;P&gt;Is the "Lookup map i..." node the exact same, just repeated 14 times? If so can you add that expression logic into the "Add GAR c..." node and put the branch after that? What your job is doing is taking all 3.2 million records and evaluating the "Lookup map i..." 14 times. Which is 3.2m records * 44.8m records to process. Moving the "Lookup map i..." logic into the "Add GAR c..." node reduces that overhead. Also you can combine some of the "Global Accou..." and "Accounting Re..." nodes and add branches after those to reduce processing.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;-shawn&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 15 Jan 2014 21:17:00 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Performance-Management-Fine-tuning-of-DataFlux-Jobs/m-p/140262#M2162</guid>
      <dc:creator>skillman</dc:creator>
      <dc:date>2014-01-15T21:17:00Z</dc:date>
    </item>
    <item>
      <title>Re: Performance Management/Fine tuning of DataFlux Jobs</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Performance-Management-Fine-tuning-of-DataFlux-Jobs/m-p/140263#M2163</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi Shawn&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have put that "Lookup Map..."&amp;nbsp; in front of branch. Its executing a bit faster. Anything apart from that which can be implemented, such as changing any of the advanced properties or the configuration files??&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 16 Jan 2014 08:31:24 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Performance-Management-Fine-tuning-of-DataFlux-Jobs/m-p/140263#M2163</guid>
      <dc:creator>Abhigyan</dc:creator>
      <dc:date>2014-01-16T08:31:24Z</dc:date>
    </item>
    <item>
      <title>Re: Performance Management/Fine tuning of DataFlux Jobs</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Performance-Management-Fine-tuning-of-DataFlux-Jobs/m-p/140264#M2164</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;The Branch node has a memory cache size option that you can increase which could help. What node are you using to output data? If you are outputting data to a database you can change the commit frequency in the options to commit every 100,000 records (as an example) instead of committing every row which is the default.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;-shawn&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 16 Jan 2014 13:24:51 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Performance-Management-Fine-tuning-of-DataFlux-Jobs/m-p/140264#M2164</guid>
      <dc:creator>skillman</dc:creator>
      <dc:date>2014-01-16T13:24:51Z</dc:date>
    </item>
    <item>
      <title>Re: Performance Management/Fine tuning of DataFlux Jobs</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Performance-Management-Fine-tuning-of-DataFlux-Jobs/m-p/140265#M2165</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi Shawn&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Will definitely try increasing the branch memory cache size option. Also will change the commit frequency and check.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;BTW, for our job, we are writing to the DB using an expression where we are using the Expression Engine Language and DSNs to write data.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 16 Jan 2014 14:26:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Performance-Management-Fine-tuning-of-DataFlux-Jobs/m-p/140265#M2165</guid>
      <dc:creator>Abhigyan</dc:creator>
      <dc:date>2014-01-16T14:26:29Z</dc:date>
    </item>
    <item>
      <title>Re: Performance Management/Fine tuning of DataFlux Jobs</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Performance-Management-Fine-tuning-of-DataFlux-Jobs/m-p/140266#M2166</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;&lt;A _jive_internal="true" href="https://communities.sas.com/people/skillman"&gt;Skillman&lt;/A&gt;, thank you for joining the conversation and providing such a helpful tip. &lt;SPAN class="j-post-author "&gt;&lt;STRONG&gt;&lt;A _jive_internal="true" class="jiveTT-hover-user jive-username-link" data-avatarid="-1" data-externalid="" data-presence="null" data-userid="826574" data-username="Abhigyan" href="https://communities.sas.com/people/Abhigyan" id="jive-8265745263930948308711"&gt;Abhigyan&lt;/A&gt;&lt;/STRONG&gt;&lt;/SPAN&gt;, try increasing the branch memory cache size option and let us know how it works for you! &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;-Anna-Marie &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 16 Jan 2014 15:03:26 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Performance-Management-Fine-tuning-of-DataFlux-Jobs/m-p/140266#M2166</guid>
      <dc:creator>anna_holland</dc:creator>
      <dc:date>2014-01-16T15:03:26Z</dc:date>
    </item>
    <item>
      <title>Re: Performance Management/Fine tuning of DataFlux Jobs</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Performance-Management-Fine-tuning-of-DataFlux-Jobs/m-p/140267#M2167</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Abhigyan,&lt;/P&gt;&lt;P&gt;The Expression DSN may be opening and closing the cursor for each record you wish to write to the database. This could definitely be your bottleneck.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;-shawn&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 16 Jan 2014 16:23:01 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Performance-Management-Fine-tuning-of-DataFlux-Jobs/m-p/140267#M2167</guid>
      <dc:creator>skillman</dc:creator>
      <dc:date>2014-01-16T16:23:01Z</dc:date>
    </item>
    <item>
      <title>Re: Performance Management/Fine tuning of DataFlux Jobs</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Performance-Management-Fine-tuning-of-DataFlux-Jobs/m-p/140268#M2168</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thanks Shawn. That is a good point you have raised. I will come back to you by implementing the changes.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 17 Jan 2014 07:30:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Performance-Management-Fine-tuning-of-DataFlux-Jobs/m-p/140268#M2168</guid>
      <dc:creator>Abhigyan</dc:creator>
      <dc:date>2014-01-17T07:30:37Z</dc:date>
    </item>
  </channel>
</rss>

