<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: load dataset from client to CAS in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/load-dataset-from-client-to-CAS/m-p/842804#M333261</link>
    <description>&lt;P&gt;I strongly suspect that the poor performance is caused by large default character column lengths being defined on the SAS dataset being read from SAP/BW. This results in the dataset size being much larger than it should be. In SAS 9.4 data libraries on disk this is not a problem as you can apply the SAS option COMPRESS to get rid of the extra space efficiently. It is advisable to not compress in-memory tables as it adversely affects performance. I know it is tedious but you can resize all long character columns using the LENGTH statement to reduce space in the on-disk dataset which should improve load times into memory a lot.&amp;nbsp; &amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 07 Nov 2022 02:13:39 GMT</pubDate>
    <dc:creator>SASKiwi</dc:creator>
    <dc:date>2022-11-07T02:13:39Z</dc:date>
    <item>
      <title>load dataset from client to CAS</title>
      <link>https://communities.sas.com/t5/SAS-Programming/load-dataset-from-client-to-CAS/m-p/833056#M329310</link>
      <description>&lt;P&gt;Hello,&lt;BR /&gt;i have a dataset with 16.000.000 record and 136 column. I load this dataset every day on cas. My procedure is this:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;data hseq.HSEQ_INSP_FIND_TEMP1;&lt;BR /&gt;set staging.HSEQ_INSP_FIND_TEMP&lt;BR /&gt;;&lt;BR /&gt;run;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;proc casutil incaslib="hseq" outcaslib="hseq"; &lt;BR /&gt;droptable casdata="HSEQ_INSP_FIND_TEMP" quiet; &lt;BR /&gt;promote casdata="HSEQ_INSP_FIND_TEMP1" casout="HSEQ_INSP_FIND_TEMP"; &lt;BR /&gt;save casdata="HSEQ_INSP_FIND_TEMP" casout="HSEQ_INSP_FIND_TEMP" replace; &lt;BR /&gt;quit;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;but is very slowly about over two hours.&lt;BR /&gt;is there another way for load dataset in cas?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thank you,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;A.&lt;/P&gt;</description>
      <pubDate>Tue, 13 Sep 2022 08:16:49 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/load-dataset-from-client-to-CAS/m-p/833056#M329310</guid>
      <dc:creator>dipand</dc:creator>
      <dc:date>2022-09-13T08:16:49Z</dc:date>
    </item>
    <item>
      <title>Re: load dataset from client to CAS</title>
      <link>https://communities.sas.com/t5/SAS-Programming/load-dataset-from-client-to-CAS/m-p/837251#M331036</link>
      <description>Hi, &lt;BR /&gt;I don't know if you still need help, but a diffrent way to load data to CAS is to use SAS DI Studio jobs scheduled out of working hours (if possible).&lt;BR /&gt;&lt;BR /&gt;Alternativeley you can use a proc fedsql to load the data but source and destination have to be from caslibs. After the upload you can use proc casutil or proc cas to promote the table.&lt;BR /&gt;Regards,&lt;BR /&gt;Smm662002</description>
      <pubDate>Thu, 06 Oct 2022 19:21:58 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/load-dataset-from-client-to-CAS/m-p/837251#M331036</guid>
      <dc:creator>smm662002</dc:creator>
      <dc:date>2022-10-06T19:21:58Z</dc:date>
    </item>
    <item>
      <title>Re: load dataset from client to CAS</title>
      <link>https://communities.sas.com/t5/SAS-Programming/load-dataset-from-client-to-CAS/m-p/837321#M331051</link>
      <description>&lt;P&gt;unfortunately my dataset is on sas 9.4 and i must load they on the cas, because i read this data from sap/bw. the sas job start all day in different hours after finish the job sap/bw and the time for load this dataset to very large and i can't major performance&lt;/P&gt;</description>
      <pubDate>Fri, 07 Oct 2022 06:36:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/load-dataset-from-client-to-CAS/m-p/837321#M331051</guid>
      <dc:creator>dipand</dc:creator>
      <dc:date>2022-10-07T06:36:29Z</dc:date>
    </item>
    <item>
      <title>Re: load dataset from client to CAS</title>
      <link>https://communities.sas.com/t5/SAS-Programming/load-dataset-from-client-to-CAS/m-p/837608#M331202</link>
      <description>&lt;P&gt;This post have some suggestions how to trigger a parallel load from SAS datasets:&lt;/P&gt;
&lt;P&gt;&lt;A href="https://blogs.sas.com/content/sgf/2019/10/17/how-to-parallel-load-and-compress-a-sas-cloud-analytic-services-cas-table/" target="_blank"&gt;https://blogs.sas.com/content/sgf/2019/10/17/how-to-parallel-load-and-compress-a-sas-cloud-analytic-services-cas-table/&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If your process is critical, and you have an MPP CAS, you can always explore these options:&lt;/P&gt;
&lt;P&gt;&lt;A href="https://communities.sas.com/t5/SAS-Communities-Library/4-Approaches-for-Parallel-Data-Loading-to-CAS/ta-p/370955" target="_blank"&gt;https://communities.sas.com/t5/SAS-Communities-Library/4-Approaches-for-Parallel-Data-Loading-to-CAS/ta-p/370955&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 10 Oct 2022 06:30:15 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/load-dataset-from-client-to-CAS/m-p/837608#M331202</guid>
      <dc:creator>LinusH</dc:creator>
      <dc:date>2022-10-10T06:30:15Z</dc:date>
    </item>
    <item>
      <title>Re: load dataset from client to CAS</title>
      <link>https://communities.sas.com/t5/SAS-Programming/load-dataset-from-client-to-CAS/m-p/837626#M331209</link>
      <description>&lt;P&gt;Just as a comment to the first link&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13674"&gt;@LinusH&lt;/a&gt;&amp;nbsp;shared to an article from 2019: I made the experience of negative query performance impact when compressing a CAS table (using &lt;EM&gt;compress=true&lt;/EM&gt;).&lt;/P&gt;
&lt;P&gt;If you're on a very recent Viya version then use&amp;nbsp;&lt;EM&gt;memoryFormat="DVR"&lt;/EM&gt; instead.&amp;nbsp;&lt;A href="https://communities.sas.com/t5/SAS-Communities-Library/Viya-2020-1-CAS-Duplicate-Value-Reduction/ta-p/707049" target="_blank"&gt;https://communities.sas.com/t5/SAS-Communities-Library/Viya-2020-1-CAS-Duplicate-Value-Reduction/ta-p/707049&lt;/A&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 10 Oct 2022 09:58:56 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/load-dataset-from-client-to-CAS/m-p/837626#M331209</guid>
      <dc:creator>Patrick</dc:creator>
      <dc:date>2022-10-10T09:58:56Z</dc:date>
    </item>
    <item>
      <title>Re: load dataset from client to CAS</title>
      <link>https://communities.sas.com/t5/SAS-Programming/load-dataset-from-client-to-CAS/m-p/842798#M333257</link>
      <description>&lt;P&gt;I have seen this method load faster.&amp;nbsp; You could try it instead of the DATA step.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;%let path=%sysfunc(pathname(staging));
proc cas;
   table.upload / path="&amp;amp;path/HSEQ_INSP_FIND_TEMP.sas7bdat" casout={caslib="hseq" name="HSEQ_INSP_FIND_TEMP1"};
quit;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;Out of the 2 hours, how much of that time is the actual load step?&lt;/P&gt;
&lt;P&gt;And are all of the 136 columns needed?&amp;nbsp; Can you drop any?&lt;/P&gt;</description>
      <pubDate>Mon, 07 Nov 2022 00:21:38 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/load-dataset-from-client-to-CAS/m-p/842798#M333257</guid>
      <dc:creator>DerylHollick</dc:creator>
      <dc:date>2022-11-07T00:21:38Z</dc:date>
    </item>
    <item>
      <title>Re: load dataset from client to CAS</title>
      <link>https://communities.sas.com/t5/SAS-Programming/load-dataset-from-client-to-CAS/m-p/842804#M333261</link>
      <description>&lt;P&gt;I strongly suspect that the poor performance is caused by large default character column lengths being defined on the SAS dataset being read from SAP/BW. This results in the dataset size being much larger than it should be. In SAS 9.4 data libraries on disk this is not a problem as you can apply the SAS option COMPRESS to get rid of the extra space efficiently. It is advisable to not compress in-memory tables as it adversely affects performance. I know it is tedious but you can resize all long character columns using the LENGTH statement to reduce space in the on-disk dataset which should improve load times into memory a lot.&amp;nbsp; &amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 07 Nov 2022 02:13:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/load-dataset-from-client-to-CAS/m-p/842804#M333261</guid>
      <dc:creator>SASKiwi</dc:creator>
      <dc:date>2022-11-07T02:13:39Z</dc:date>
    </item>
    <item>
      <title>Re: load dataset from client to CAS</title>
      <link>https://communities.sas.com/t5/SAS-Programming/load-dataset-from-client-to-CAS/m-p/842823#M333269</link>
      <description>my customer don't want that delete any columns, but i try your istruction. Thank you so much</description>
      <pubDate>Mon, 07 Nov 2022 06:13:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/load-dataset-from-client-to-CAS/m-p/842823#M333269</guid>
      <dc:creator>dipand</dc:creator>
      <dc:date>2022-11-07T06:13:55Z</dc:date>
    </item>
  </channel>
</rss>

