<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Extracting subset of observations from merging large datasets in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Extracting-subset-of-observations-from-merging-large-datasets/m-p/312465#M270753</link>
    <description>&lt;P&gt;Efficiency is dependent on &lt;STRONG&gt;input&lt;/STRONG&gt; (that you described), on the &lt;STRONG&gt;target&lt;/STRONG&gt; (what do you wand to do / analyze / report)&lt;/P&gt;
&lt;P&gt;and on &lt;STRONG&gt;resources&lt;/STRONG&gt; you have (like disk space, memory available);&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Another issue - is your &lt;STRONG&gt;data&lt;/STRONG&gt; already in &lt;STRONG&gt;SAS tables&lt;/STRONG&gt; or in &lt;STRONG&gt;external data&lt;/STRONG&gt; (CSV or text or DataBase and if DB - what kind)&lt;/P&gt;
&lt;P&gt;All those have effect on programming code.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Even after getting information, as above, there maybe several ways to do the work efficiently.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If possible, describe what variables are in each file kind.&lt;/P&gt;
&lt;P&gt;Meanwhile I understand that you have 2 kinds of data:&lt;/P&gt;
&lt;P&gt;1) a table of firms in each state &amp;nbsp;(variables:&amp;nbsp;&lt;SPAN&gt;FIRMID, STATENAME)&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;2) employment data in (11 years X 4 quarters X number of states) files.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; What are the names of those files - are they in a common format ?&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 17 Nov 2016 21:13:51 GMT</pubDate>
    <dc:creator>Shmuel</dc:creator>
    <dc:date>2016-11-17T21:13:51Z</dc:date>
    <item>
      <title>Extracting subset of observations from merging large datasets</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Extracting-subset-of-observations-from-merging-large-datasets/m-p/312438#M270752</link>
      <description>&lt;P style="background: white;"&gt;&lt;SPAN style="color: #333333;"&gt;Hello there:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P style="background: white;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="background: white;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="background: white; font-variant-ligatures: normal; font-variant-caps: normal; orphans: 2; text-align: start; widows: 2; -webkit-text-stroke-width: 0px; min-height: 8pt; word-spacing: 0px;"&gt;&lt;SPAN style="color: #333333;"&gt;I have FILE A which consists of 500,000 firms in a country. For each firm, I have FIRMID and STATENAME. I want to get annual employment data (4&lt;SUP&gt;th&lt;/SUP&gt; quarter) for these firms from 2000-2010.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P style="background: white;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="background: white;"&gt;&lt;SPAN style="color: #333333;"&gt;The employment data is available as individual files for each state and each quarter from 2000 to 2010. There are several million firms overall across all the states. What is the most efficient way of extracting the 4&lt;SUP&gt;th&lt;/SUP&gt; quarter employment data each year from these files for just the 500,000 firms in FILE A?&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P style="background: white;"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P style="background: white;"&gt;&lt;SPAN style="color: #333333;"&gt;Any help with the code would be much appreciated.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P style="background: white;"&gt;&lt;SPAN style="color: #333333;"&gt;Thanks&lt;/SPAN&gt;&lt;/P&gt;
&lt;P style="background: white;"&gt;Dana&lt;/P&gt;</description>
      <pubDate>Thu, 17 Nov 2016 19:50:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Extracting-subset-of-observations-from-merging-large-datasets/m-p/312438#M270752</guid>
      <dc:creator>dshills</dc:creator>
      <dc:date>2016-11-17T19:50:18Z</dc:date>
    </item>
    <item>
      <title>Re: Extracting subset of observations from merging large datasets</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Extracting-subset-of-observations-from-merging-large-datasets/m-p/312465#M270753</link>
      <description>&lt;P&gt;Efficiency is dependent on &lt;STRONG&gt;input&lt;/STRONG&gt; (that you described), on the &lt;STRONG&gt;target&lt;/STRONG&gt; (what do you wand to do / analyze / report)&lt;/P&gt;
&lt;P&gt;and on &lt;STRONG&gt;resources&lt;/STRONG&gt; you have (like disk space, memory available);&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Another issue - is your &lt;STRONG&gt;data&lt;/STRONG&gt; already in &lt;STRONG&gt;SAS tables&lt;/STRONG&gt; or in &lt;STRONG&gt;external data&lt;/STRONG&gt; (CSV or text or DataBase and if DB - what kind)&lt;/P&gt;
&lt;P&gt;All those have effect on programming code.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Even after getting information, as above, there maybe several ways to do the work efficiently.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If possible, describe what variables are in each file kind.&lt;/P&gt;
&lt;P&gt;Meanwhile I understand that you have 2 kinds of data:&lt;/P&gt;
&lt;P&gt;1) a table of firms in each state &amp;nbsp;(variables:&amp;nbsp;&lt;SPAN&gt;FIRMID, STATENAME)&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;2) employment data in (11 years X 4 quarters X number of states) files.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; What are the names of those files - are they in a common format ?&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 17 Nov 2016 21:13:51 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Extracting-subset-of-observations-from-merging-large-datasets/m-p/312465#M270753</guid>
      <dc:creator>Shmuel</dc:creator>
      <dc:date>2016-11-17T21:13:51Z</dc:date>
    </item>
  </channel>
</rss>

