<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Sorting large data in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Sorting-large-data/m-p/788783#M252292</link>
    <description>&lt;P&gt;Where is the dataset tmp coming from? If it is already in the right data order then you don't need to sort again.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Have you tried your program without the sort and if so what does your SAS log report?&lt;/P&gt;</description>
    <pubDate>Fri, 07 Jan 2022 03:42:19 GMT</pubDate>
    <dc:creator>SASKiwi</dc:creator>
    <dc:date>2022-01-07T03:42:19Z</dc:date>
    <item>
      <title>Sorting large data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Sorting-large-data/m-p/788781#M252290</link>
      <description>Data are not sorted. I am running below codes:&lt;BR /&gt;Proc sort data=tmp; by id name var;&lt;BR /&gt;&lt;BR /&gt;Data tmp1;&lt;BR /&gt;Set tmp;&lt;BR /&gt;By id name var;&lt;BR /&gt;If first.name and last.var;&lt;BR /&gt;Run;&lt;BR /&gt;&lt;BR /&gt;How can I do this without sorting?&lt;BR /&gt;My data is a very large and it is taking over an hour to just sort. Any help?&lt;BR /&gt;Thank you</description>
      <pubDate>Fri, 07 Jan 2022 03:44:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Sorting-large-data/m-p/788781#M252290</guid>
      <dc:creator>Emma2021</dc:creator>
      <dc:date>2022-01-07T03:44:35Z</dc:date>
    </item>
    <item>
      <title>Re: Sorting large data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Sorting-large-data/m-p/788783#M252292</link>
      <description>&lt;P&gt;Where is the dataset tmp coming from? If it is already in the right data order then you don't need to sort again.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Have you tried your program without the sort and if so what does your SAS log report?&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jan 2022 03:42:19 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Sorting-large-data/m-p/788783#M252292</guid>
      <dc:creator>SASKiwi</dc:creator>
      <dc:date>2022-01-07T03:42:19Z</dc:date>
    </item>
    <item>
      <title>Re: Sorting large data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Sorting-large-data/m-p/788784#M252293</link>
      <description>They are not sorted by those 3 key variables.</description>
      <pubDate>Fri, 07 Jan 2022 03:43:53 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Sorting-large-data/m-p/788784#M252293</guid>
      <dc:creator>Emma2021</dc:creator>
      <dc:date>2022-01-07T03:43:53Z</dc:date>
    </item>
    <item>
      <title>Re: Sorting large data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Sorting-large-data/m-p/788805#M252308</link>
      <description>&lt;P&gt;We need more information, so please post proc contents of your dataset "tmp". This will show the size of the dataset along with information about the variables contained, maybe tagsort could be used. Also please post some lines of the dataset and the expected result - as always as a working data step, maybe another approach is possible.&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jan 2022 06:18:07 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Sorting-large-data/m-p/788805#M252308</guid>
      <dc:creator>andreas_lds</dc:creator>
      <dc:date>2022-01-07T06:18:07Z</dc:date>
    </item>
    <item>
      <title>Re: Sorting large data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Sorting-large-data/m-p/788809#M252311</link>
      <description>&lt;P&gt;Please post the complete output of PROC CONTENTS for your dataset.&lt;/P&gt;
&lt;P&gt;In which environment do you run this code (single machine or client-server)?&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jan 2022 08:29:47 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Sorting-large-data/m-p/788809#M252311</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2022-01-07T08:29:47Z</dc:date>
    </item>
    <item>
      <title>Re: Sorting large data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Sorting-large-data/m-p/788899#M252357</link>
      <description>&lt;P&gt;Have you tried the TAGSORT option? This reduces the size of temporary utility files used during sort and if part of the time is related to moving lots of data in/out of the utility set may reduce execution time. May not.&lt;/P&gt;
&lt;PRE&gt;Proc sort data=tmp tagsort; 
   by id name var;
run;
&lt;/PRE&gt;
&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/381519"&gt;@Emma2021&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;Data are not sorted. I am running below codes:&lt;BR /&gt;Proc sort data=tmp; by id name var;&lt;BR /&gt;&lt;BR /&gt;Data tmp1;&lt;BR /&gt;Set tmp;&lt;BR /&gt;By id name var;&lt;BR /&gt;If first.name and last.var;&lt;BR /&gt;Run;&lt;BR /&gt;&lt;BR /&gt;How can I do this without sorting?&lt;BR /&gt;My data is a very large and it is taking over an hour to just sort. Any help?&lt;BR /&gt;Thank you&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jan 2022 15:50:17 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Sorting-large-data/m-p/788899#M252357</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2022-01-07T15:50:17Z</dc:date>
    </item>
    <item>
      <title>Re: Sorting large data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Sorting-large-data/m-p/788901#M252358</link>
      <description>There are only 4 variables —all about 200 length string except var is numeric integer data.</description>
      <pubDate>Fri, 07 Jan 2022 15:51:26 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Sorting-large-data/m-p/788901#M252358</guid>
      <dc:creator>Emma2021</dc:creator>
      <dc:date>2022-01-07T15:51:26Z</dc:date>
    </item>
    <item>
      <title>Re: Sorting large data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Sorting-large-data/m-p/788907#M252361</link>
      <description>&lt;P&gt;I asked for the complete output of PROC CONTENTS for a purpose, not because I am extremely bored.&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jan 2022 16:12:48 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Sorting-large-data/m-p/788907#M252361</guid>
      <dc:creator>Kurt_Bremser</dc:creator>
      <dc:date>2022-01-07T16:12:48Z</dc:date>
    </item>
    <item>
      <title>Re: Sorting large data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Sorting-large-data/m-p/789268#M252534</link>
      <description>&lt;P&gt;First, make sure you have your memory options set to utilize as much as possible of your working memory (MEMSIZE, SORTSIZE).&lt;/P&gt;
&lt;P&gt;But as some point you need to sort that data.&lt;/P&gt;
&lt;P&gt;You could ommit storing a tempory data set (probably not the the biggest part of the processing) by using a single SQL statement:&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;proc sql;
create table tmp1 as
select id, name, var
from tmp
group by id, name, var
having count(*) = 1;
quit;&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Mon, 10 Jan 2022 15:37:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Sorting-large-data/m-p/789268#M252534</guid>
      <dc:creator>LinusH</dc:creator>
      <dc:date>2022-01-10T15:37:31Z</dc:date>
    </item>
    <item>
      <title>Re: Sorting large data</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Sorting-large-data/m-p/789330#M252567</link>
      <description>&lt;P&gt;Long character variables like 200 characters take up a lot of space resulting in large datasets. Using the COMPRESS = YES SAS option might reduce your dataset sizes and speed up your processing.&lt;/P&gt;</description>
      <pubDate>Mon, 10 Jan 2022 19:35:44 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Sorting-large-data/m-p/789330#M252567</guid>
      <dc:creator>SASKiwi</dc:creator>
      <dc:date>2022-01-10T19:35:44Z</dc:date>
    </item>
  </channel>
</rss>

