<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Joining two very large tables in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Joining-two-very-large-tables/m-p/661692#M197732</link>
    <description>&lt;P&gt;If you have a one-to-many or one-to-one relationship, consider sorting and a data step merge.&lt;/P&gt;
&lt;P&gt;2500 columns lets me suspect a design issue; depending on your needs, a longitudinal dataset structure might be better.&lt;/P&gt;
&lt;P&gt;Depending on your observation size, you might simply run into the limits of your storage infrastructure.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please supply more details (SQL step code, observation size of your datasets, especially of that with just 50 variables).&lt;/P&gt;</description>
    <pubDate>Wed, 17 Jun 2020 11:41:29 GMT</pubDate>
    <dc:creator>Kurt_Bremser</dc:creator>
    <dc:date>2020-06-17T11:41:29Z</dc:date>
    <item>
      <title>Joining two very large tables</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Joining-two-very-large-tables/m-p/661685#M197726</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I have two tables, each with about 4-5 million rows of data.&lt;/P&gt;&lt;P&gt;Table A has about 50 columns and table B 2500.&lt;/P&gt;&lt;P&gt;The key is a combination of 4 columns.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Using SQL takes about a half hour.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Any suggestions on how to speed things up would be greatly appreciated.&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Eli&lt;/P&gt;</description>
      <pubDate>Wed, 17 Jun 2020 11:30:49 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Joining-two-very-large-tables/m-p/661685#M197726</guid>
      <dc:creator>eliber</dc:creator>
      <dc:date>2020-06-17T11:30:49Z</dc:date>
    </item>
    <item>
      <title>Re: Joining two very large tables</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Joining-two-very-large-tables/m-p/661689#M197729</link>
      <description>&lt;P&gt;Can you post your code please?&lt;/P&gt;</description>
      <pubDate>Wed, 17 Jun 2020 11:37:05 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Joining-two-very-large-tables/m-p/661689#M197729</guid>
      <dc:creator>PeterClemmensen</dc:creator>
      <dc:date>2020-06-17T11:37:05Z</dc:date>
    </item>
    <item>
      <title>Re: Joining two very large tables</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Joining-two-very-large-tables/m-p/661692#M197732</link>
      <description>&lt;P&gt;If you have a one-to-many or one-to-one relationship, consider sorting and a data step merge.&lt;/P&gt;
&lt;P&gt;2500 columns lets me suspect a design issue; depending on your needs, a longitudinal dataset structure might be better.&lt;/P&gt;
&lt;P&gt;Depending on your observation size, you might simply run into the limits of your storage infrastructure.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please supply more details (SQL step code, observation size of your datasets, especially of that with just 50 variables).&lt;/P&gt;</description>
      <pubDate>Wed, 17 Jun 2020 11:41:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Joining-two-very-large-tables/m-p/661692#M197732</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2020-06-17T11:41:29Z</dc:date>
    </item>
    <item>
      <title>Re: Joining two very large tables</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Joining-two-very-large-tables/m-p/661694#M197734</link>
      <description>Did you try MERGE ?&lt;BR /&gt;Or could try Hash Table if your have big memory .</description>
      <pubDate>Wed, 17 Jun 2020 11:43:45 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Joining-two-very-large-tables/m-p/661694#M197734</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2020-06-17T11:43:45Z</dc:date>
    </item>
    <item>
      <title>Re: Joining two very large tables</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Joining-two-very-large-tables/m-p/661697#M197737</link>
      <description>&lt;P&gt;There are things you can do, but half an hour is not that long time for a join like that.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If your data is in a DBMS, take a look at the READBUFF libname for dataset option (assuming that you extract all the rows or most of them). If the output data also goes to a DBMS, the INSERTBUFF option can also help. If reading from and writing to SAS, the READBUFF and BUFFNO options may be of value.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If your data is sorted by the keys, try using a data step instead of SQL.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If you are only accessing a minor portion of the tables, indexes may help.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 17 Jun 2020 11:48:45 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Joining-two-very-large-tables/m-p/661697#M197737</guid>
      <dc:creator>s_lassen</dc:creator>
      <dc:date>2020-06-17T11:48:45Z</dc:date>
    </item>
    <item>
      <title>Re: Joining two very large tables</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Joining-two-very-large-tables/m-p/661721#M197746</link>
      <description>&lt;P&gt;Ho&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/334143"&gt;@eliber&lt;/a&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Try the technique found in this paper &lt;A title="Hash + Point = Key" href="https://www.lexjansen.com/nesug/nesug11/ld/ld01.pdf" target="_self"&gt;Hash + Point = Key&lt;/A&gt;. it will give you options for your large tables&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Hope this helps,&lt;/P&gt;
&lt;P&gt;Ahmed&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 17 Jun 2020 12:39:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Joining-two-very-large-tables/m-p/661721#M197746</guid>
      <dc:creator>AhmedAl_Attar</dc:creator>
      <dc:date>2020-06-17T12:39:35Z</dc:date>
    </item>
  </channel>
</rss>

