<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Can I able to run a dataset with a size of 5.5TB in SAS Enterprise Guide</title>
    <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/Can-I-able-to-run-a-dataset-with-a-size-of-5-5TB/m-p/725230#M38321</link>
    <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/143413"&gt;@Sairampulipati&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;P&gt;Team,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I have scenario where am expecting a dataset with size approximately equal to 5.5 TB. My concern is using SAS EG 7.1 or 8.2 can I able to analyze this 5.5 TB sized dataset? If yes, how much time will it take to run a simple proc freq code or so.. If there is any complex code how much time does it take run.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks in Advance.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Best Regards,&lt;/P&gt;
&lt;P&gt;Sairam&amp;nbsp;&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;What ever you do, it may help to explicitly reduce the number of variables if they are not needed.&lt;/P&gt;
&lt;P&gt;If by simple you meant:&lt;/P&gt;
&lt;PRE&gt;Proc freq data=bigset;
run;&lt;/PRE&gt;
&lt;P&gt;is suspect you may run out of results window space if you have many levels of many varibles. You can reduce the memory use a bit with something like this to only the variables of interest/ needed for any analysis.&lt;/P&gt;
&lt;PRE&gt;Proc freq data=bigset (keep= var1 var2 var5 );
run;&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 10 Mar 2021 18:54:43 GMT</pubDate>
    <dc:creator>ballardw</dc:creator>
    <dc:date>2021-03-10T18:54:43Z</dc:date>
    <item>
      <title>Can I able to run a dataset with a size of 5.5TB</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/Can-I-able-to-run-a-dataset-with-a-size-of-5-5TB/m-p/725216#M38316</link>
      <description>&lt;P&gt;Team,&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have scenario where am expecting a dataset with size approximately equal to 5.5 TB. My concern is using SAS EG 7.1 or 8.2 can I able to analyze this 5.5 TB sized dataset? If yes, how much time will it take to run a simple proc freq code or so.. If there is any complex code how much time does it take run.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks in Advance.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Best Regards,&lt;/P&gt;&lt;P&gt;Sairam&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 10 Mar 2021 17:50:05 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/Can-I-able-to-run-a-dataset-with-a-size-of-5-5TB/m-p/725216#M38316</guid>
      <dc:creator>Sairampulipati</dc:creator>
      <dc:date>2021-03-10T17:50:05Z</dc:date>
    </item>
    <item>
      <title>Re: Can I able to run a dataset with a size of 5.5TB</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/Can-I-able-to-run-a-dataset-with-a-size-of-5-5TB/m-p/725222#M38317</link>
      <description>&lt;P&gt;In general SAS EG doesn't care about the data size, as it's SAS on the server that does the processing. Is your data a SAS data set file or is it in a database?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Your data set is pretty large and PROC FREQ can be memory-intensive. So depending what you're trying to do, you might consider looking at different methods like PROC SUMMARY or PROC SQL -- if they can support the computations you need. If your data is in a database though, then SAS will push the operation down to the database to limit the amount of data movement.&lt;/P&gt;</description>
      <pubDate>Wed, 10 Mar 2021 17:57:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/Can-I-able-to-run-a-dataset-with-a-size-of-5-5TB/m-p/725222#M38317</guid>
      <dc:creator>ChrisHemedinger</dc:creator>
      <dc:date>2021-03-10T17:57:12Z</dc:date>
    </item>
    <item>
      <title>Re: Can I able to run a dataset with a size of 5.5TB</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/Can-I-able-to-run-a-dataset-with-a-size-of-5-5TB/m-p/725224#M38318</link>
      <description>&lt;P&gt;This is not a matter of Enterprise Guide, but of the workspace server that does all the work. The performance depends on the system/hardware of the server.&lt;/P&gt;</description>
      <pubDate>Wed, 10 Mar 2021 18:00:09 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/Can-I-able-to-run-a-dataset-with-a-size-of-5-5TB/m-p/725224#M38318</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2021-03-10T18:00:09Z</dc:date>
    </item>
    <item>
      <title>Re: Can I able to run a dataset with a size of 5.5TB</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/Can-I-able-to-run-a-dataset-with-a-size-of-5-5TB/m-p/725228#M38319</link>
      <description>&lt;P&gt;Thanks Chris for your inputs. The SAS dataset am speaking is a file not a database. I am worried if it takes lot of time to run.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 10 Mar 2021 18:14:41 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/Can-I-able-to-run-a-dataset-with-a-size-of-5-5TB/m-p/725228#M38319</guid>
      <dc:creator>Sairampulipati</dc:creator>
      <dc:date>2021-03-10T18:14:41Z</dc:date>
    </item>
    <item>
      <title>Re: Can I able to run a dataset with a size of 5.5TB</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/Can-I-able-to-run-a-dataset-with-a-size-of-5-5TB/m-p/725229#M38320</link>
      <description>Is there any recommendation of system/hardware specifications to run this kind of huge data?</description>
      <pubDate>Wed, 10 Mar 2021 18:16:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/Can-I-able-to-run-a-dataset-with-a-size-of-5-5TB/m-p/725229#M38320</guid>
      <dc:creator>Sairampulipati</dc:creator>
      <dc:date>2021-03-10T18:16:18Z</dc:date>
    </item>
    <item>
      <title>Re: Can I able to run a dataset with a size of 5.5TB</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/Can-I-able-to-run-a-dataset-with-a-size-of-5-5TB/m-p/725230#M38321</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/143413"&gt;@Sairampulipati&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;P&gt;Team,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I have scenario where am expecting a dataset with size approximately equal to 5.5 TB. My concern is using SAS EG 7.1 or 8.2 can I able to analyze this 5.5 TB sized dataset? If yes, how much time will it take to run a simple proc freq code or so.. If there is any complex code how much time does it take run.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks in Advance.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Best Regards,&lt;/P&gt;
&lt;P&gt;Sairam&amp;nbsp;&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;What ever you do, it may help to explicitly reduce the number of variables if they are not needed.&lt;/P&gt;
&lt;P&gt;If by simple you meant:&lt;/P&gt;
&lt;PRE&gt;Proc freq data=bigset;
run;&lt;/PRE&gt;
&lt;P&gt;is suspect you may run out of results window space if you have many levels of many varibles. You can reduce the memory use a bit with something like this to only the variables of interest/ needed for any analysis.&lt;/P&gt;
&lt;PRE&gt;Proc freq data=bigset (keep= var1 var2 var5 );
run;&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 10 Mar 2021 18:54:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/Can-I-able-to-run-a-dataset-with-a-size-of-5-5TB/m-p/725230#M38321</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2021-03-10T18:54:43Z</dc:date>
    </item>
    <item>
      <title>Re: Can I able to run a dataset with a size of 5.5TB</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/Can-I-able-to-run-a-dataset-with-a-size-of-5-5TB/m-p/725234#M38322</link>
      <description>&lt;P&gt;Say you've got average hardware that gives you a sustained data transfer rate of 200 MB/s.&lt;/P&gt;
&lt;P&gt;This lets you read 1 GB in 5 seconds.&lt;/P&gt;
&lt;P&gt;5.5 TB are roughly 5500 GB, so you would need up to 30,000 seconds to process it for one sequential pass (which is what FREQ needs).&lt;/P&gt;
&lt;P&gt;The cardinality of your variables will determine the amount of memory needed.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;So it comes down to do some measurements on your existing hardware to get your transfer rates, and get to know your data.&lt;/P&gt;
&lt;P&gt;For approximations, you can start with a small subset to get a feel for the dimensions.&lt;/P&gt;</description>
      <pubDate>Wed, 10 Mar 2021 18:46:45 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/Can-I-able-to-run-a-dataset-with-a-size-of-5-5TB/m-p/725234#M38322</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2021-03-10T18:46:45Z</dc:date>
    </item>
    <item>
      <title>Re: Can I able to run a dataset with a size of 5.5TB</title>
      <link>https://communities.sas.com/t5/SAS-Enterprise-Guide/Can-I-able-to-run-a-dataset-with-a-size-of-5-5TB/m-p/725407#M38323</link>
      <description>If you have enough money , try SAS Viya  or  SAS SPDServer</description>
      <pubDate>Thu, 11 Mar 2021 11:45:34 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Enterprise-Guide/Can-I-able-to-run-a-dataset-with-a-size-of-5-5TB/m-p/725407#M38323</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2021-03-11T11:45:34Z</dc:date>
    </item>
  </channel>
</rss>

