<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Accessing big data using SAS PC in SAS Data Management</title>
    <link>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458486#M14193</link>
    <description>&lt;P&gt;&lt;FONT color="#000080" face="Courier New" size="3"&gt;&lt;STRONG&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/159"&gt;@Tom&lt;/a&gt;&amp;nbsp;I decided to use a macro to see how it works.&amp;nbsp;Referring to the different librefs below is my code.&lt;FONT color="#000080" face="Courier New" size="3"&gt;&lt;STRONG&gt;I am not sure if it fits in memory, but it would be worth trying.&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#000080" face="Courier New" size="3"&gt;&lt;STRONG&gt;%macro&lt;/STRONG&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt; allfiles(varlist=);&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;nbsp;libname2.orig1&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;amp;varlist&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;nbsp;&lt;FONT face="Courier New"&gt;libname&lt;/FONT&gt;2.orig2&amp;amp;&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;varlist&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;nbsp;&lt;FONT face="Courier New"&gt;libname&lt;/FONT&gt;3.orig3&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;amp;varlist&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;nbsp;&lt;FONT face="Courier New"&gt;libname4&lt;/FONT&gt;.orig4&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;amp;varlist&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;nbsp;&lt;FONT face="Courier New"&gt;libname5&lt;/FONT&gt;.orig5&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;amp;varlist&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;nbsp;&lt;FONT face="Courier New"&gt;libname5&lt;/FONT&gt;.orig6&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;amp;varlist&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#000080" face="Courier New" size="3"&gt;&lt;STRONG&gt;%mend&lt;/STRONG&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;　&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#000080" face="Courier New" size="3"&gt;&lt;STRONG&gt;data&lt;/STRONG&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt; out_all;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#0000ff" face="Courier New" size="3"&gt;set&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt; %all&lt;STRONG&gt;&lt;I&gt;files&lt;/I&gt;&lt;/STRONG&gt;(varlist=&lt;/FONT&gt;&lt;FONT color="#0000ff" face="Courier New" size="3"&gt;%str&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;((&lt;/FONT&gt;&lt;FONT color="#0000ff" face="Courier New" size="3"&gt;keep&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;=id&amp;nbsp;var1&amp;nbsp;var2 type);&amp;nbsp;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#000080" face="Courier New" size="3"&gt;&lt;STRONG&gt;run&lt;/STRONG&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Sun, 29 Apr 2018 21:58:50 GMT</pubDate>
    <dc:creator>dr2014</dc:creator>
    <dc:date>2018-04-29T21:58:50Z</dc:date>
    <item>
      <title>Accessing big data uisng SAS PC</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458432#M14181</link>
      <description>&lt;P&gt;Hi all,&lt;/P&gt;
&lt;P&gt;I am working with big data the size of around 158,000,000 KB . I am using SAS EG on PC. Its taking a long time to bring in the data. I will be using 'views' to work with the data and also limiting the data to few required variables.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;What are some of the things I should take into account while working with 'views' and/or other ways I could work with the data faster? Your inputs will be really helpful. Please let me know as soon as possible. Thanks!&lt;/P&gt;</description>
      <pubDate>Sun, 29 Apr 2018 11:27:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458432#M14181</guid>
      <dc:creator>dr2014</dc:creator>
      <dc:date>2018-04-29T11:27:37Z</dc:date>
    </item>
    <item>
      <title>Re: Accessing big data uisng SAS PC</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458445#M14182</link>
      <description>&lt;P&gt;Define "bring in the data". Do you mean reading a SAS dataset for analysis, or importing the data from external sources?&lt;/P&gt;</description>
      <pubDate>Sun, 29 Apr 2018 15:18:16 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458445#M14182</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2018-04-29T15:18:16Z</dc:date>
    </item>
    <item>
      <title>Re: Accessing big data using SAS PC</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458453#M14183</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/11562"&gt;@Kurt_Bremser&lt;/a&gt;&amp;nbsp;sure. I am working with a sas dataset the size of 186 GB. The 'view' for the sas dataset 'have' gets created instantly but the run time for the next datastep doesn' t seem to&amp;nbsp;end. Below is the code I am running presently.&amp;nbsp;I selected both the data steps to run and its about 20&amp;nbsp; minutes now and&amp;nbsp;its&amp;nbsp;still running.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#000080" face="Courier New" size="3"&gt;&lt;STRONG&gt;data&lt;/STRONG&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt; want(&lt;/FONT&gt;&lt;FONT color="#0000ff" face="Courier New" size="3"&gt;keep&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;=id&amp;nbsp;var1 var2 var3 var4 var5 var6 var7 var8 var9 var10 var11&lt;FONT face="Courier New" size="3"&gt;) /&lt;/FONT&gt;&lt;FONT color="#0000ff" face="Courier New" size="3"&gt;view&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;=want;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#0000ff" face="Courier New" size="3"&gt;set&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt; libref.have;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;LI-WRAPPER&gt;&lt;/LI-WRAPPER&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#000080" face="Courier New" size="3"&gt;&lt;STRONG&gt;run&lt;/STRONG&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#000080" face="Courier New" size="3"&gt;&lt;STRONG&gt;data&lt;/STRONG&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt; want_2;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#0000ff" face="Courier New" size="3"&gt;set&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt; want;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#0000ff" face="Courier New" size="3"&gt;if&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt; id=&lt;/FONT&gt;&lt;STRONG&gt;&lt;FONT color="#008080" face="Courier New" size="3"&gt;.&lt;/FONT&gt;&lt;/STRONG&gt; &lt;FONT color="#0000ff" face="Courier New" size="3"&gt;then&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt; id2=&lt;/FONT&gt;&lt;STRONG&gt;&lt;FONT color="#008080" face="Courier New" size="3"&gt;0&lt;/FONT&gt;&lt;/STRONG&gt;&lt;FONT face="Courier New" size="3"&gt;;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#0000ff" face="Courier New" size="3"&gt;else&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt; id2=&lt;/FONT&gt;&lt;STRONG&gt;&lt;FONT color="#008080" face="Courier New" size="3"&gt;1&lt;/FONT&gt;&lt;/STRONG&gt;&lt;FONT face="Courier New" size="3"&gt;;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#000080" face="Courier New" size="3"&gt;&lt;STRONG&gt;run&lt;/STRONG&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 29 Apr 2018 16:04:23 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458453#M14183</guid>
      <dc:creator>dr2014</dc:creator>
      <dc:date>2018-04-29T16:04:23Z</dc:date>
    </item>
    <item>
      <title>Re: Accessing big data using SAS PC</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458455#M14184</link>
      <description>&lt;P&gt;Don't make a copy of the large data if you can avoid it.&amp;nbsp; What you have posted as two steps could all be done in the first step.&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;data want /view=want;
  set libref.have(keep=id var1-var11) ;
  id2 = not missing(id);
run;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;What are you doing to DO with the data?&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 29 Apr 2018 16:13:34 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458455#M14184</guid>
      <dc:creator>Tom</dc:creator>
      <dc:date>2018-04-29T16:13:34Z</dc:date>
    </item>
    <item>
      <title>Re: Accessing big data using SAS PC</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458463#M14185</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/159"&gt;@Tom&lt;/a&gt; thanks for the advice. I actually need to work&amp;nbsp; a lot more with these views. Doing a proc sort on the view is also time consuming. Is that normal?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;For e.g: for the view 'want'&lt;/P&gt;
&lt;P&gt;proc sort data=want out=want_sort; by id; run;&lt;/P&gt;</description>
      <pubDate>Sun, 29 Apr 2018 16:47:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458463#M14185</guid>
      <dc:creator>dr2014</dc:creator>
      <dc:date>2018-04-29T16:47:54Z</dc:date>
    </item>
    <item>
      <title>Re: Accessing big data using SAS PC</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458465#M14186</link>
      <description>&lt;P&gt;You are again making a copy of the data.&lt;/P&gt;
&lt;P&gt;If you need the data sorted in a way it is not already sorted then you might look at using an INDEX instead.&lt;/P&gt;
&lt;P&gt;I am not sure that an index can be used with a view however.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Do you really need to sort the data?&lt;/P&gt;
&lt;P&gt;What are you ultimately trying to do?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 29 Apr 2018 16:50:36 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458465#M14186</guid>
      <dc:creator>Tom</dc:creator>
      <dc:date>2018-04-29T16:50:36Z</dc:date>
    </item>
    <item>
      <title>Re: Accessing big data using SAS PC</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458469#M14187</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/159"&gt;@Tom&lt;/a&gt;&amp;nbsp;I get your point but I have&amp;nbsp;a larger task at hand. The system is not built to handle big data and I am thinking of ways to process it faster. Here is a detailed list of my task. Please advice me on what's the best way to go about it:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;(1) I have 5 large sas datasets orig1 to orig5 around 186 gb size each&amp;nbsp;. I just need a few variables from each of these datasets with non-missing ids for which I would like to create views called orig_1v to orig_5v;&lt;/P&gt;
&lt;P&gt;So the code will be. Creating a view allows me to get what I need instantly.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;data orig_1v (keep=id start_date2 end_date type)/view=orig_1v;&lt;/P&gt;
&lt;P&gt;set orig1;&lt;/P&gt;
&lt;P&gt;where id ne .;&lt;/P&gt;
&lt;P&gt;run;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Below is the sample of each of the orig_1v to Orig5v views &lt;U&gt;(please ignore the last 2 empty columns)&lt;/U&gt;&lt;/P&gt;
&lt;TABLE width="291"&gt;
&lt;TBODY&gt;
&lt;TR&gt;
&lt;TD width="204"&gt;
&lt;TABLE width="193"&gt;
&lt;TBODY&gt;
&lt;TR&gt;
&lt;TD width="44"&gt;
&lt;P&gt;id&lt;/P&gt;
&lt;/TD&gt;
&lt;TD width="57"&gt;
&lt;P&gt;start_date2&lt;/P&gt;
&lt;/TD&gt;
&lt;TD width="49"&gt;
&lt;P&gt;end_date&lt;/P&gt;
&lt;/TD&gt;
&lt;TD width="44"&gt;
&lt;P&gt;type&lt;/P&gt;
&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;/TD&gt;
&lt;TD width="44"&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;/TD&gt;
&lt;TD width="44"&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;(2) I need only certain ids from orig_1v to orig_5v which are present in the dataset ‘required’. Below is the sample of the ‘required’ sas dataset&lt;/P&gt;
&lt;TABLE width="128"&gt;
&lt;TBODY&gt;
&lt;TR&gt;
&lt;TD width="64"&gt;
&lt;P&gt;id&lt;/P&gt;
&lt;/TD&gt;
&lt;TD width="64"&gt;
&lt;P&gt;start_date&lt;/P&gt;
&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD width="64"&gt;
&lt;P&gt;001&lt;/P&gt;
&lt;/TD&gt;
&lt;TD width="64"&gt;
&lt;P&gt;1/1/2001&lt;/P&gt;
&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD width="64"&gt;
&lt;P&gt;002&lt;/P&gt;
&lt;/TD&gt;
&lt;TD width="64"&gt;
&lt;P&gt;1/2/2002&lt;/P&gt;
&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD width="64"&gt;
&lt;P&gt;003&lt;/P&gt;
&lt;/TD&gt;
&lt;TD width="64"&gt;
&lt;P&gt;1/2/3002&lt;/P&gt;
&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD width="64"&gt;
&lt;P&gt;004&lt;/P&gt;
&lt;/TD&gt;
&lt;TD width="64"&gt;
&lt;P&gt;1/3/2002&lt;/P&gt;
&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;(3) I can either stack the views first (which I don't know if it’s a good idea to do that):&lt;/P&gt;
&lt;P&gt;data orig_vall;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;Set orig_1v&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; orig_2v&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;orig_3v&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; orig_4v&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; orig_5v;&lt;/P&gt;
&lt;P&gt;run;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;(4) Then I can do an inner join to create another view:&lt;/P&gt;
&lt;P&gt;proc sql;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;create view as want&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; Select a*.,b.start_date&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; from orig_all as a&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Inner join required as b&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; on a.id=b.id;&lt;/P&gt;
&lt;P&gt;quit;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Once I have the view ‘want’ I have to create derived variables and flag them based on conditions related to the dates:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;start_date, start_date2 and end_date. Could I do that using the view 'want'?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Is this a good approach? Please let me know. Thanks.&lt;/P&gt;</description>
      <pubDate>Sun, 29 Apr 2018 18:14:50 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458469#M14187</guid>
      <dc:creator>dr2014</dc:creator>
      <dc:date>2018-04-29T18:14:50Z</dc:date>
    </item>
    <item>
      <title>Re: Accessing big data using SAS PC</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458470#M14188</link>
      <description>&lt;P&gt;As the first step, create new&amp;nbsp;&lt;EM&gt;datasets&lt;/EM&gt; (not views) that only contain the needed variables. These will sort a lot faster.&lt;/P&gt;
&lt;P&gt;The first step will of course take lots of time, but a sequential read will be the fastest you can do.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Take a look at the overall data rate of your storage. With data this size, you need something that can deliver 500 MB/s or faster.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 29 Apr 2018 19:03:42 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458470#M14188</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2018-04-29T19:03:42Z</dc:date>
    </item>
    <item>
      <title>Re: Accessing big data using SAS PC</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458480#M14189</link>
      <description>&lt;P&gt;If you have SAS on a remote server available to you it would be far quicker there.&lt;/P&gt;</description>
      <pubDate>Sun, 29 Apr 2018 21:17:59 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458480#M14189</guid>
      <dc:creator>SASKiwi</dc:creator>
      <dc:date>2018-04-29T21:17:59Z</dc:date>
    </item>
    <item>
      <title>Re: Accessing big data using SAS PC</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458482#M14190</link>
      <description>&lt;P&gt;Since (2) seems to be the part that will subset the data the most do that first.&lt;/P&gt;
&lt;P&gt;How large is this list? Can you fit it in a macro variable?&lt;/P&gt;
&lt;P&gt;Can you fit it in memory?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 29 Apr 2018 21:34:41 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458482#M14190</guid>
      <dc:creator>Tom</dc:creator>
      <dc:date>2018-04-29T21:34:41Z</dc:date>
    </item>
    <item>
      <title>Re: Accessing big data using SAS PC</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458483#M14191</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/11562"&gt;@Kurt_Bremser&lt;/a&gt; thanks for your suggestion. I will do that. How is it possible to estimate the overall data rate of storage? I would like to see that and provide information to our technical analyst in case we need any changes. Please let me know.&lt;/P&gt;</description>
      <pubDate>Sun, 29 Apr 2018 21:36:15 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458483#M14191</guid>
      <dc:creator>dr2014</dc:creator>
      <dc:date>2018-04-29T21:36:15Z</dc:date>
    </item>
    <item>
      <title>Re: Accessing big data using SAS PC</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458484#M14192</link>
      <description>&lt;P&gt;It is on a remote server but its not helping much with the run time.&lt;/P&gt;</description>
      <pubDate>Sun, 29 Apr 2018 21:37:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458484#M14192</guid>
      <dc:creator>dr2014</dc:creator>
      <dc:date>2018-04-29T21:37:29Z</dc:date>
    </item>
    <item>
      <title>Re: Accessing big data using SAS PC</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458486#M14193</link>
      <description>&lt;P&gt;&lt;FONT color="#000080" face="Courier New" size="3"&gt;&lt;STRONG&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/159"&gt;@Tom&lt;/a&gt;&amp;nbsp;I decided to use a macro to see how it works.&amp;nbsp;Referring to the different librefs below is my code.&lt;FONT color="#000080" face="Courier New" size="3"&gt;&lt;STRONG&gt;I am not sure if it fits in memory, but it would be worth trying.&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#000080" face="Courier New" size="3"&gt;&lt;STRONG&gt;%macro&lt;/STRONG&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt; allfiles(varlist=);&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;nbsp;libname2.orig1&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;amp;varlist&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;nbsp;&lt;FONT face="Courier New"&gt;libname&lt;/FONT&gt;2.orig2&amp;amp;&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;varlist&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;nbsp;&lt;FONT face="Courier New"&gt;libname&lt;/FONT&gt;3.orig3&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;amp;varlist&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;nbsp;&lt;FONT face="Courier New"&gt;libname4&lt;/FONT&gt;.orig4&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;amp;varlist&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;nbsp;&lt;FONT face="Courier New"&gt;libname5&lt;/FONT&gt;.orig5&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;amp;varlist&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;nbsp;&lt;FONT face="Courier New"&gt;libname5&lt;/FONT&gt;.orig6&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;&amp;amp;varlist&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#000080" face="Courier New" size="3"&gt;&lt;STRONG&gt;%mend&lt;/STRONG&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;　&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#000080" face="Courier New" size="3"&gt;&lt;STRONG&gt;data&lt;/STRONG&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt; out_all;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#0000ff" face="Courier New" size="3"&gt;set&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt; %all&lt;STRONG&gt;&lt;I&gt;files&lt;/I&gt;&lt;/STRONG&gt;(varlist=&lt;/FONT&gt;&lt;FONT color="#0000ff" face="Courier New" size="3"&gt;%str&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;((&lt;/FONT&gt;&lt;FONT color="#0000ff" face="Courier New" size="3"&gt;keep&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;=id&amp;nbsp;var1&amp;nbsp;var2 type);&amp;nbsp;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#000080" face="Courier New" size="3"&gt;&lt;STRONG&gt;run&lt;/STRONG&gt;&lt;/FONT&gt;&lt;FONT face="Courier New" size="3"&gt;;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 29 Apr 2018 21:58:50 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458486#M14193</guid>
      <dc:creator>dr2014</dc:creator>
      <dc:date>2018-04-29T21:58:50Z</dc:date>
    </item>
    <item>
      <title>Re: Accessing big data using SAS PC</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458498#M14194</link>
      <description>&lt;P&gt;I suggest making your post subject a bit clearer then.&lt;/P&gt;</description>
      <pubDate>Mon, 30 Apr 2018 00:33:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458498#M14194</guid>
      <dc:creator>SASKiwi</dc:creator>
      <dc:date>2018-04-30T00:33:30Z</dc:date>
    </item>
    <item>
      <title>Re: Accessing big data using SAS PC</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458500#M14195</link>
      <description>&lt;P&gt;Macro code is not going to change the SAS code that is run or how SAS runs it. It just makes it easier for you type it.&amp;nbsp; For just 5 or 6 datasets if is just is probably easier to just use copy/paste to replicate the code rather than complicate your program with macro code.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Most SAS jobs are I/O bound and it sounds like that is your problem. Run PROC CONTENTS on your input datasets and check some things. Make sure they were made with the same operating system and version of SAS.&amp;nbsp; If SAS has to use CEDA to read the files it will slow it down a lot.&amp;nbsp; Check if they are already sorted by the ID variable(or at least if SAS thinks so).&amp;nbsp; Eliminating the need to sort is a hugh deal.&amp;nbsp; Check if they are using SAS compression.&amp;nbsp; Many SAS datasets can get 60-80 reduction in disk size just by using the COMPRESS=BINARY option when creating them.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Try using the SPDE engine to store your new subsets.&amp;nbsp; That can really reduce the disk space needed to store the data.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The main thing is subset as early as possible.&amp;nbsp; &amp;nbsp;&lt;/P&gt;
&lt;P&gt;If your list of ID values is small then store them in a macro variable.&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;proc sql noprint;
 select distinct id into :idlist separated by ' '
 from relation
 ;
quit;

%let varlist=id var1 vara varb varc ;
proc sort data=libname1.orig1(keep=&amp;amp;varlist) out=subset1 (compress=binary);
  by id;
  where id in (&amp;amp;idlist);
run;
...
data combine/view=combine;
  set subset1 subset2 subset3 .... ;
  by id;
run;

data final (compress=binary);
  merge relation combine ;
  by id;
run;    &lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Mon, 30 Apr 2018 01:06:52 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458500#M14195</guid>
      <dc:creator>Tom</dc:creator>
      <dc:date>2018-04-30T01:06:52Z</dc:date>
    </item>
    <item>
      <title>Re: Accessing big data using SAS PC</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458516#M14196</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/44608"&gt;@dr2014&lt;/a&gt; wrote:&lt;BR /&gt;
&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/11562"&gt;@Kurt_Bremser&lt;/a&gt; thanks for your suggestion. I will do that. How is it possible to estimate the overall data rate of storage? I would like to see that and provide information to our technical analyst in case we need any changes. Please let me know.&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;Just do a&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;options fullstimer;

data _null_;
set yourlib.yourdata;
run;&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;and look at the log. Divide the size by the time, and you have your sustained read throughput.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If you pull data from a remote DBMS, the network will be your bottleneck. In this case, unload as much of the initial "slicing" to the database, with explicit pass-through, if needed.&lt;/P&gt;</description>
      <pubDate>Mon, 30 Apr 2018 04:40:34 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458516#M14196</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2018-04-30T04:40:34Z</dc:date>
    </item>
    <item>
      <title>Re: Accessing big data using SAS PC</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458685#M14205</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/159"&gt;@Tom&lt;/a&gt;&amp;nbsp;Thanks for all your suggestions. I will definitely be implementing them.&amp;nbsp;I have about 20000 ids so it will be hard to put them in a macro. It would have certainly helped to subset to those ids in the first step itself.&lt;/P&gt;</description>
      <pubDate>Mon, 30 Apr 2018 15:18:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458685#M14205</guid>
      <dc:creator>dr2014</dc:creator>
      <dc:date>2018-04-30T15:18:30Z</dc:date>
    </item>
    <item>
      <title>Re: Accessing big data using SAS PC</title>
      <link>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458688#M14206</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/11562"&gt;@Kurt_Bremser&lt;/a&gt;&amp;nbsp;I will check use your code to check on the time. Thanks for all your advice. Basically, more than 1 person has contributed on the solution. Thanks much!&lt;/P&gt;</description>
      <pubDate>Mon, 30 Apr 2018 15:22:20 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Management/Accessing-big-data-uisng-SAS-PC/m-p/458688#M14206</guid>
      <dc:creator>dr2014</dc:creator>
      <dc:date>2018-04-30T15:22:20Z</dc:date>
    </item>
  </channel>
</rss>

