<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic how to improve update performance in SAS Data Management</title>
    <link>https://communities.sas.com/t5/SAS-Data-Management/how-to-improve-update-performance/m-p/175119#M3514</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;hi all,&lt;/P&gt;&lt;P&gt;I am working with sas data management studio.In that i was using update node in one of my job.My target node is having 65 lakh records and my transaction is having 74000 records. when i execute update node with following query&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;update master a&lt;/P&gt;&lt;P&gt;set a.action_status=( select b.action_status from trans b where a.cust_id=b.cust_id)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;when i ran the update node it is running for 1 1/2 day...and it is still running for the second day also..&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;can anyone help me on this and also share your views to increase the performance..&lt;/P&gt;&lt;P&gt;MANY THANKS IN ADVANCE.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Thu, 10 Apr 2014 06:41:28 GMT</pubDate>
    <dc:creator>sandeep_reddy</dc:creator>
    <dc:date>2014-04-10T06:41:28Z</dc:date>
    <item>
      <title>how to improve update performance</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/how-to-improve-update-performance/m-p/175119#M3514</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;hi all,&lt;/P&gt;&lt;P&gt;I am working with sas data management studio.In that i was using update node in one of my job.My target node is having 65 lakh records and my transaction is having 74000 records. when i execute update node with following query&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;update master a&lt;/P&gt;&lt;P&gt;set a.action_status=( select b.action_status from trans b where a.cust_id=b.cust_id)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;when i ran the update node it is running for 1 1/2 day...and it is still running for the second day also..&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;can anyone help me on this and also share your views to increase the performance..&lt;/P&gt;&lt;P&gt;MANY THANKS IN ADVANCE.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 10 Apr 2014 06:41:28 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/how-to-improve-update-performance/m-p/175119#M3514</guid>
      <dc:creator>sandeep_reddy</dc:creator>
      <dc:date>2014-04-10T06:41:28Z</dc:date>
    </item>
    <item>
      <title>Re: how to improve update performance</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/how-to-improve-update-performance/m-p/175120#M3515</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hash Table.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 10 Apr 2014 08:52:19 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/how-to-improve-update-performance/m-p/175120#M3515</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2014-04-10T08:52:19Z</dc:date>
    </item>
    <item>
      <title>Re: how to improve update performance</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/how-to-improve-update-performance/m-p/175121#M3516</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Index the fields you are joining and the action_status field.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 10 Apr 2014 10:23:58 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/how-to-improve-update-performance/m-p/175121#M3516</guid>
      <dc:creator>skillman</dc:creator>
      <dc:date>2014-04-10T10:23:58Z</dc:date>
    </item>
    <item>
      <title>Re: how to improve update performance</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/how-to-improve-update-performance/m-p/175122#M3517</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I don't have SAS Data Management Studio but in from your code I have one idea what could be the problem:&lt;/P&gt;&lt;P&gt;your update is making select from table b for every row in table a which means 6,5 million times a select from (74000 rows inner join with 6,5 million rows) with a answer of 1 action status.&lt;/P&gt;&lt;P&gt;this can't be fast!&lt;/P&gt;&lt;P&gt;However you could make a filter to limit the rows in a big table like this&lt;/P&gt;&lt;P&gt;update master a&lt;/P&gt;&lt;P&gt;set a.action_status=(select b.action_status from trans b where a.cust_id=b.cus_id)&lt;/P&gt;&lt;P&gt;where a.cust_id IN (select DISTINCT cust_id from trans)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;or you rewrite it to be a data step:&lt;/P&gt;&lt;P&gt;data master;&lt;/P&gt;&lt;P&gt;set master trans;&lt;/P&gt;&lt;P&gt;BY cust_id;&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;However, the second option had a need that data are sorted by cust_id. If you build an index by cust_id this could help also.&lt;/P&gt;&lt;P&gt;Just few thoughts&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 10 Apr 2014 11:30:07 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/how-to-improve-update-performance/m-p/175122#M3517</guid>
      <dc:creator>ajuvaba</dc:creator>
      <dc:date>2014-04-10T11:30:07Z</dc:date>
    </item>
    <item>
      <title>Re: how to improve update performance</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/how-to-improve-update-performance/m-p/175123#M3518</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Many attention points to check and evaluate as there is too little background information.&amp;nbsp; &lt;/P&gt;&lt;P&gt; I = ksharp, although this will need a special user-transformation for DI&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; The hash approach could be extended to a datastep vs SQL approach&amp;nbsp; &lt;/P&gt;&lt;P&gt;II = skillman optimizing datasets within SQL behavior &lt;/P&gt;&lt;P&gt;III = Change some SAS system options for better performance&lt;/P&gt;&lt;P&gt;More? &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 10 Apr 2014 11:34:09 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/how-to-improve-update-performance/m-p/175123#M3518</guid>
      <dc:creator>jakarman</dc:creator>
      <dc:date>2014-04-10T11:34:09Z</dc:date>
    </item>
    <item>
      <title>Re: how to improve update performance</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/how-to-improve-update-performance/m-p/175124#M3519</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Are you using the Data Update node or the SQL Execute node? If you are not passing data to the data update node, and using database tables to determine the records to update (which it looks like you are doing based on the information provided) then I would use the SQL Execute node to push this update to the database and take advantage of it's memory/cpu.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 10 Apr 2014 12:55:11 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/how-to-improve-update-performance/m-p/175124#M3519</guid>
      <dc:creator>skillman</dc:creator>
      <dc:date>2014-04-10T12:55:11Z</dc:date>
    </item>
  </channel>
</rss>

