<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Working on various columns all together in a huge datasets in proc sql in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Working-on-various-columns-all-together-in-a-huge-datasets-in/m-p/229845#M268013</link>
    <description>&lt;P&gt;hi,&lt;/P&gt;&lt;P&gt;I am working on a huge data set with over 400 variables and around 99999 observations. It is a log detailing the usage of calls (local, international, duration, etc) , data, amount charged etc. I have another data set with the list of churners and the churn months. This data os given for 6 months and over 3 years separately.&lt;/P&gt;&lt;P&gt;1. I am not able to join the 2 data sets on the key variable : Product&amp;nbsp;&lt;/P&gt;&lt;P&gt;It always gives me insufficient memory error in log. Tried to run it on first 40 obs then also it did not work.&lt;/P&gt;&lt;P&gt;2. I have to create a data set which has the details of calls for only 3 selected months keeping everything else same.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Since loops are not allowed in proc sql, can anyone please help me out how to go about it?&lt;/P&gt;</description>
    <pubDate>Wed, 14 Oct 2015 05:44:24 GMT</pubDate>
    <dc:creator>mansinarang12</dc:creator>
    <dc:date>2015-10-14T05:44:24Z</dc:date>
    <item>
      <title>Working on various columns all together in a huge datasets in proc sql</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Working-on-various-columns-all-together-in-a-huge-datasets-in/m-p/229845#M268013</link>
      <description>&lt;P&gt;hi,&lt;/P&gt;&lt;P&gt;I am working on a huge data set with over 400 variables and around 99999 observations. It is a log detailing the usage of calls (local, international, duration, etc) , data, amount charged etc. I have another data set with the list of churners and the churn months. This data os given for 6 months and over 3 years separately.&lt;/P&gt;&lt;P&gt;1. I am not able to join the 2 data sets on the key variable : Product&amp;nbsp;&lt;/P&gt;&lt;P&gt;It always gives me insufficient memory error in log. Tried to run it on first 40 obs then also it did not work.&lt;/P&gt;&lt;P&gt;2. I have to create a data set which has the details of calls for only 3 selected months keeping everything else same.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Since loops are not allowed in proc sql, can anyone please help me out how to go about it?&lt;/P&gt;</description>
      <pubDate>Wed, 14 Oct 2015 05:44:24 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Working-on-various-columns-all-together-in-a-huge-datasets-in/m-p/229845#M268013</guid>
      <dc:creator>mansinarang12</dc:creator>
      <dc:date>2015-10-14T05:44:24Z</dc:date>
    </item>
    <item>
      <title>Re: Working on various columns all together in a huge datasets in proc sql</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Working-on-various-columns-all-together-in-a-huge-datasets-in/m-p/229846#M268014</link>
      <description>Neither of those datasets sounds large enough to run out of memory.  What size GB are the SAS data sets? Also, please post your code.</description>
      <pubDate>Wed, 14 Oct 2015 05:48:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Working-on-various-columns-all-together-in-a-huge-datasets-in/m-p/229846#M268014</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2015-10-14T05:48:54Z</dc:date>
    </item>
    <item>
      <title>Re: Working on various columns all together in a huge datasets in proc sql</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Working-on-various-columns-all-together-in-a-huge-datasets-in/m-p/229847#M268015</link>
      <description>&lt;P&gt;for joining of the data sets i used the code:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;proc sql;&lt;BR /&gt;select c.*, s.*&lt;BR /&gt;from cnl as c right join smb as s&lt;BR /&gt;on s.product_id = c.product_id;&lt;BR /&gt;quit;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;the size of smb is 1.2 gb and cnl is a few kbs.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;for the second part where I need usage for only 3 selected months I do not know &amp;nbsp;how to go about it. Please help.&lt;/P&gt;</description>
      <pubDate>Wed, 14 Oct 2015 05:57:59 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Working-on-various-columns-all-together-in-a-huge-datasets-in/m-p/229847#M268015</guid>
      <dc:creator>mansinarang12</dc:creator>
      <dc:date>2015-10-14T05:57:59Z</dc:date>
    </item>
    <item>
      <title>Re: Working on various columns all together in a huge datasets in proc sql</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Working-on-various-columns-all-together-in-a-huge-datasets-in/m-p/229852#M268016</link>
      <description>&lt;P&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/58017"&gt;@mansinarang12﻿&lt;/a&gt;&lt;/P&gt;
&lt;P&gt;Not sure why you're running out of memory. For the SQL you've posted an alternative approach which should perform better would be using a hash lookup table. Condition for below code to work properly (as well as for your SQL): PRODUCT_ID is the primary key in both tables.&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;
data want(drop=_rc);
  if _n_=1 then
    do;
      if 0 then set cnl;
      dcl hash h (dataset:'cnl');
      h.defineKey('product_id');
      h.defineData(all:'y');
      h.defineDone();
    end;
  call missing(of _all_);

  set smb;
  _rc=h.find();
run;
&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 14 Oct 2015 08:47:56 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Working-on-various-columns-all-together-in-a-huge-datasets-in/m-p/229852#M268016</guid>
      <dc:creator>Patrick</dc:creator>
      <dc:date>2015-10-14T08:47:56Z</dc:date>
    </item>
    <item>
      <title>Re: Working on various columns all together in a huge datasets in proc sql</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Working-on-various-columns-all-together-in-a-huge-datasets-in/m-p/229904#M268017</link>
      <description>&lt;P&gt;First thing with any date related topic: are you dates SAS date valued variables? That makes anything related to dates much easier in 99.9% of cases.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 14 Oct 2015 14:23:49 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Working-on-various-columns-all-together-in-a-huge-datasets-in/m-p/229904#M268017</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2015-10-14T14:23:49Z</dc:date>
    </item>
  </channel>
</rss>

