<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Export big data(Billion + rows) from SAS table to disk in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Export-big-data-Billion-rows-from-SAS-table-to-disk/m-p/459644#M284582</link>
    <description>&lt;P&gt;I am accessing data on remote SAS serer using SAS EG , the SAS Table is stored in a Sybase DB on server.&lt;/P&gt;&lt;P&gt;Export to csv or tab separated.&lt;/P&gt;&lt;P&gt;My end goal is to read the data in python program for applying machine learning&lt;/P&gt;</description>
    <pubDate>Thu, 03 May 2018 13:22:44 GMT</pubDate>
    <dc:creator>Pranav_</dc:creator>
    <dc:date>2018-05-03T13:22:44Z</dc:date>
    <item>
      <title>Export big data(Billion + rows) from SAS table to disk</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Export-big-data-Billion-rows-from-SAS-table-to-disk/m-p/459493#M284579</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I want to export a table with billion + rows to my local from SAS EG.&amp;nbsp;&lt;/P&gt;&lt;P&gt;I was wondering if there is anyway that the data can be zipped or compressed first and then the data is exported to local to reduce the I/O. Any suggestions?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 02 May 2018 21:01:49 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Export-big-data-Billion-rows-from-SAS-table-to-disk/m-p/459493#M284579</guid>
      <dc:creator>Pranav_</dc:creator>
      <dc:date>2018-05-02T21:01:49Z</dc:date>
    </item>
    <item>
      <title>Re: Export big data(Billion + rows) from SAS table to disk</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Export-big-data-Billion-rows-from-SAS-table-to-disk/m-p/459500#M284580</link>
      <description>&lt;P&gt;Where are you processing this data now? On local SAS or on remote server SAS? Where is the SAS Table stored? And what to you mean by export - export to what file format or just copy as SAS table?&lt;/P&gt;</description>
      <pubDate>Wed, 02 May 2018 22:14:42 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Export-big-data-Billion-rows-from-SAS-table-to-disk/m-p/459500#M284580</guid>
      <dc:creator>SASKiwi</dc:creator>
      <dc:date>2018-05-02T22:14:42Z</dc:date>
    </item>
    <item>
      <title>Re: Export big data(Billion + rows) from SAS table to disk</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Export-big-data-Billion-rows-from-SAS-table-to-disk/m-p/459599#M284581</link>
      <description>&lt;P&gt;Valid questions by &lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13976"&gt;@SASKiwi&lt;/a&gt;.&lt;/P&gt;
&lt;P&gt;Look at proc export.&lt;/P&gt;
&lt;P&gt;It's quite old, so I am unsure how it compares with zipping in terms of file size.&lt;/P&gt;
&lt;P&gt;It's very handy for whole libraries, but you may be better off just zipping if it's only one file and the sides have the same OS.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 03 May 2018 10:35:59 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Export-big-data-Billion-rows-from-SAS-table-to-disk/m-p/459599#M284581</guid>
      <dc:creator>ChrisNZ</dc:creator>
      <dc:date>2018-05-03T10:35:59Z</dc:date>
    </item>
    <item>
      <title>Re: Export big data(Billion + rows) from SAS table to disk</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Export-big-data-Billion-rows-from-SAS-table-to-disk/m-p/459644#M284582</link>
      <description>&lt;P&gt;I am accessing data on remote SAS serer using SAS EG , the SAS Table is stored in a Sybase DB on server.&lt;/P&gt;&lt;P&gt;Export to csv or tab separated.&lt;/P&gt;&lt;P&gt;My end goal is to read the data in python program for applying machine learning&lt;/P&gt;</description>
      <pubDate>Thu, 03 May 2018 13:22:44 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Export-big-data-Billion-rows-from-SAS-table-to-disk/m-p/459644#M284582</guid>
      <dc:creator>Pranav_</dc:creator>
      <dc:date>2018-05-03T13:22:44Z</dc:date>
    </item>
    <item>
      <title>Re: Export big data(Billion + rows) from SAS table to disk</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Export-big-data-Billion-rows-from-SAS-table-to-disk/m-p/459650#M284583</link>
      <description>&lt;P&gt;Hi &lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/16961"&gt;@ChrisNZ&lt;/a&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you for the input&lt;/P&gt;&lt;P&gt;I checked the proc export. It does not provide any option to compress file before export. As the table is massive the plain export will take significantly longer and might get time out&amp;nbsp;as well.&lt;/P&gt;&lt;P&gt;I am working in an Enterprise so the default security and login restrictions apply&lt;/P&gt;</description>
      <pubDate>Thu, 03 May 2018 13:33:22 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Export-big-data-Billion-rows-from-SAS-table-to-disk/m-p/459650#M284583</guid>
      <dc:creator>Pranav_</dc:creator>
      <dc:date>2018-05-03T13:33:22Z</dc:date>
    </item>
    <item>
      <title>Re: Export big data(Billion + rows) from SAS table to disk</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Export-big-data-Billion-rows-from-SAS-table-to-disk/m-p/459853#M284584</link>
      <description>&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;&amp;gt;It does not provide any option to compress file before export.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;It always compresses.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;So the process is&amp;nbsp;Sybase =&amp;gt; CSV =&amp;gt; Python script?&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;SAS has nothing to do with this process then.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;The only value that can be added is transferring with SAS/ACCESS, but for that kind of volume and considering&amp;nbsp;the end-result is CSV, it is probably a lot more efficient to ask the sybase admins to dump the table to a text file, zip it, transfer it&amp;nbsp; and unzip it.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 03 May 2018 21:09:22 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Export-big-data-Billion-rows-from-SAS-table-to-disk/m-p/459853#M284584</guid>
      <dc:creator>ChrisNZ</dc:creator>
      <dc:date>2018-05-03T21:09:22Z</dc:date>
    </item>
    <item>
      <title>Re: Export big data(Billion + rows) from SAS table to disk</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Export-big-data-Billion-rows-from-SAS-table-to-disk/m-p/459855#M284585</link>
      <description>&lt;P&gt;Hi &lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/16961"&gt;@ChrisNZ&lt;/a&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The access to Sybase db is though SAS. Unfortunately going directly to admins is not an possible option.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I was wondering/curious in case there is any functionality provided by sas to compress the table to zip or tarz(any other universal format)&amp;nbsp; before export&amp;nbsp;to local(disk) to reduce IO&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 03 May 2018 21:15:42 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Export-big-data-Billion-rows-from-SAS-table-to-disk/m-p/459855#M284585</guid>
      <dc:creator>Pranav_</dc:creator>
      <dc:date>2018-05-03T21:15:42Z</dc:date>
    </item>
    <item>
      <title>Re: Export big data(Billion + rows) from SAS table to disk</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Export-big-data-Billion-rows-from-SAS-table-to-disk/m-p/459866#M284586</link>
      <description>&lt;P&gt;You could use PROC EXPORT, but that doesn't provide many tuning options.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You could use PROC SQL, which works a little bit closer to the database.&amp;nbsp; And if you have SAS 9.4 M5, you can GZIP the output "on the fly."&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;ods _all_ close;
filename out ZIP "/u/myaccount/project/table.csv.gz" GZIP;
ods csv file=out;
proc sql;
 select * from sashelp.class;
quit;
ods csv close;
&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;ODS CSV might not be the fastest at writing the output, but at least the final result should be compressed.&amp;nbsp; It's worth testing with just a subset at first and see if it works for you.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 03 May 2018 21:41:36 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Export-big-data-Billion-rows-from-SAS-table-to-disk/m-p/459866#M284586</guid>
      <dc:creator>ChrisHemedinger</dc:creator>
      <dc:date>2018-05-03T21:41:36Z</dc:date>
    </item>
    <item>
      <title>Re: Export big data(Billion + rows) from SAS table to disk</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Export-big-data-Billion-rows-from-SAS-table-to-disk/m-p/459885#M284587</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/4"&gt;@ChrisHemedinger&lt;/a&gt;'s solution is probably the best you can do.&lt;/P&gt;
&lt;P&gt;Just point proc sql to your Sybase data and download.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The one change I would make is split the download.&lt;/P&gt;
&lt;P&gt;You'll get disconnections or errors, you don't want to start over.&lt;/P&gt;
&lt;P&gt;Just make your python script scan several files instead of one if you can.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You can split by a variable of your choice, hopefully a refresh or capture date of some sort&lt;/P&gt;</description>
      <pubDate>Thu, 03 May 2018 22:41:17 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Export-big-data-Billion-rows-from-SAS-table-to-disk/m-p/459885#M284587</guid>
      <dc:creator>ChrisNZ</dc:creator>
      <dc:date>2018-05-03T22:41:17Z</dc:date>
    </item>
    <item>
      <title>Re: Export big data(Billion + rows) from SAS table to disk</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Export-big-data-Billion-rows-from-SAS-table-to-disk/m-p/460039#M284588</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/4"&gt;@ChrisHemedinger&lt;/a&gt;&amp;nbsp;Thank you! seems like a viable option &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 04 May 2018 13:57:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Export-big-data-Billion-rows-from-SAS-table-to-disk/m-p/460039#M284588</guid>
      <dc:creator>Pranav_</dc:creator>
      <dc:date>2018-05-04T13:57:37Z</dc:date>
    </item>
  </channel>
</rss>

