BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
Mike_B
Obsidian | Level 7

I have a data set from which a couple sets, identical to the source set, are derived. The original code sorted each derived set one at a time. To speed up the program, I would like to change it to just sort the source data set. But I notice when I do this that the source set's metadata shows "SORTED BY whatever", but the derived sets do not show that they are sorted by anything. 

 

My question is, does it matter? Is there any practical difference between 2 sets with the same exact record order if one's metadata indicates how it is sorted and the other does not?

 

Thanks. I know kind of a weird question.

 

data source;
infile datalines dlm=',';
length a $1. b 3.;
input a $ b;
datalines;
a,1
b,2
c,3
d,4
e,5
;
proc sort;
by a;
run;

data derived1 derived2;
set source;
by a;
run;
1 ACCEPTED SOLUTION

Accepted Solutions
FreelanceReinh
Jade | Level 19

Hi @Mike_B,

 

You can use the SORTEDBY= dataset option to attach that metadata to the derived datasets (without sorting them):

data derived(sortedby=a) derived2(sortedby=a);
set source;
by a;
run;

View solution in original post

5 REPLIES 5
Reeza
Super User
Only later on, if you try and do another sort, SAS will attempt to sort it again, whereas if you have a sorted by tag then it knows it's sorted and will not sort it again.

FreelanceReinh
Jade | Level 19

Hi @Mike_B,

 

You can use the SORTEDBY= dataset option to attach that metadata to the derived datasets (without sorting them):

data derived(sortedby=a) derived2(sortedby=a);
set source;
by a;
run;
Mike_B
Obsidian | Level 7

Thanks. Found this article that elaborates on the concept. So I decided to use the sortedby data set option + the presorted proc sort option.

 

data derived1(sortedby=a) derived2(sortedby=a);
set source;
by a;
run;

proc sort data=derived1 presorted;
by a;
proc sort data=derived2 presorted;
by a;
run;
PhilC
Rhodochrosite | Level 12

For completeness there is this article on "Sorted Datasets" and the "The Sort Indicator".  http://support.sas.com/documentation/cdl/en/lrcon/62955/HTML/default/viewer.htm#a000766829.htm#a0031...

 

ChrisNZ
Tourmaline | Level 20

> My question is, does it matter?
This metadata is hugely important if you want to leverage one of SAS's strengths in terms or storage: it keeps data sets sorted.

 

Note that there are 2 sorted flags. The VALIDATED sort flag is the one that matters as SAS can ignore the flag that you set yourself in some cases. See here.

 

The last step when creating a permanent table should be a proc sort (with option presorted if applicable) so the VALIDATED flag is set. The output data set should also set other options: index= if needed, compression= to choose the best method for that particular data, and the write= alter= options, to avoiding anyone overwriting the data set by accident. The password can be well-known if needed. The goal is to avoid accidental changes. Other useful options can be set too to improve performance.

 

For example something like:

proc sort data=SUMS 
          out =PRODLIB.FINAL( index   =(CUSTOMER_ID) 
                              compress=char 
                              write   =ProdData 
                              alter   =ProdData
                              label   ="Created %sysfunc(datetime(),datetime20.) by &sysuserid"
                              bufsize =32k
                              bufno   =25
                              pointobs=no)
          presorted;
  by DATE; 
run;

 

 

 

SAS Innovate 2025: Save the Date

 SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!

Save the date!

What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 5 replies
  • 1231 views
  • 8 likes
  • 5 in conversation