<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Data set keep -- anyway to speed up the code and procssing time? in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Data-set-keep-anyway-to-speed-up-the-code-and-procssing-time/m-p/423860#M104274</link>
    <description>&lt;P&gt;Proc append or the append statement in proc datasets can be more efficient than the set statement in certain circumstances because it can use block i/o. Something like:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;proc sql;
create table work.compiled_ip like raw.ccaes103;
quit;

proc datasets library=raw nolist;   
append base=work.compiled_ip data=ccaes103;
append base=work.compiled_ip data=ccaes113;
append base=work.compiled_ip data=ccaes122;
append base=work.compiled_ip data=ccaes132;
run;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;(untested)&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 28 Dec 2017 07:16:35 GMT</pubDate>
    <dc:creator>PGStats</dc:creator>
    <dc:date>2017-12-28T07:16:35Z</dc:date>
    <item>
      <title>Data set keep -- anyway to speed up the code and procssing time?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Data-set-keep-anyway-to-speed-up-the-code-and-procssing-time/m-p/423814#M104262</link>
      <description>&lt;P&gt;Hi all,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I'm currently running&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;	data compiled_ip;
		set raw.ccaes103 raw.ccaes113 raw.ccaes122 raw.ccaes132; &lt;BR /&gt;		&lt;BR /&gt;		keep dx1 dobyr age svcdate enrolid;
	run;&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;But each of the files is &amp;gt;35 GB and takes a long time. Any way I can make this more efficient? Already tried to reduce the size by selecting only the&amp;nbsp;5 essential variables.&lt;/P&gt;</description>
      <pubDate>Thu, 28 Dec 2017 01:04:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Data-set-keep-anyway-to-speed-up-the-code-and-procssing-time/m-p/423814#M104262</guid>
      <dc:creator>cdubs</dc:creator>
      <dc:date>2017-12-28T01:04:30Z</dc:date>
    </item>
    <item>
      <title>Re: Data set keep -- anyway to speed up the code and procssing time?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Data-set-keep-anyway-to-speed-up-the-code-and-procssing-time/m-p/423816#M104263</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/174079"&gt;@cdubs&lt;/a&gt;&lt;/P&gt;
&lt;P&gt;You could try below syntax which will only load the desired variables into the PDV.&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;data compiled_ip;
  set 
    raw.ccaes103 (keep= dx1 dobyr age svcdate enrolid)
    raw.ccaes113 (keep= dx1 dobyr age svcdate enrolid) 
    raw.ccaes122 (keep= dx1 dobyr age svcdate enrolid) 
    raw.ccaes132 (keep= dx1 dobyr age svcdate enrolid)
  ;
run;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;I would expect some performance improvement but not a big one.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;All other measure depend on your environment and could require quite a bit of coding and testing which is imho only worth doing if this is a production job on the critical path.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;As the bottleneck is most likely disk or network I/O my first measure would be to ensure that your source data sets get stored compressed.&lt;/P&gt;</description>
      <pubDate>Thu, 28 Dec 2017 01:26:27 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Data-set-keep-anyway-to-speed-up-the-code-and-procssing-time/m-p/423816#M104263</guid>
      <dc:creator>Patrick</dc:creator>
      <dc:date>2017-12-28T01:26:27Z</dc:date>
    </item>
    <item>
      <title>Re: Data set keep -- anyway to speed up the code and procssing time?</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Data-set-keep-anyway-to-speed-up-the-code-and-procssing-time/m-p/423860#M104274</link>
      <description>&lt;P&gt;Proc append or the append statement in proc datasets can be more efficient than the set statement in certain circumstances because it can use block i/o. Something like:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;proc sql;
create table work.compiled_ip like raw.ccaes103;
quit;

proc datasets library=raw nolist;   
append base=work.compiled_ip data=ccaes103;
append base=work.compiled_ip data=ccaes113;
append base=work.compiled_ip data=ccaes122;
append base=work.compiled_ip data=ccaes132;
run;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;(untested)&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 28 Dec 2017 07:16:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Data-set-keep-anyway-to-speed-up-the-code-and-procssing-time/m-p/423860#M104274</guid>
      <dc:creator>PGStats</dc:creator>
      <dc:date>2017-12-28T07:16:35Z</dc:date>
    </item>
  </channel>
</rss>

