BookmarkSubscribeRSS Feed
☑ This topic is solved. Need further help from the community? Please sign in and ask a new question.
GN0001
Barite | Level 11

Hello team,

I am reviewing a log for the program I ran. I can see this:

Tagsort reads each observation of the input data set twice.

 

Is this a mechanism in SAS that tagsort reads each observation two times?

 

Respectfully,

blue blue

 

Blue Blue
1 ACCEPTED SOLUTION

Accepted Solutions
ballardw
Super User

That is normal. The procedure gets the BY variables and the record identifiers, sorts those, then matches to the full data set by record identifier. There may be a time penalty to reduce the disk usage.

You are trading off time for disk space as normally Proc Sort makes two additional copies of the data, roughly, before overwriting the source set (if not creating a different output data set) and removing the temporary data.

View solution in original post

2 REPLIES 2
Astounding
PROC Star

Yes.

 

Do you have access to documentation about PROC SORT?

ballardw
Super User

That is normal. The procedure gets the BY variables and the record identifiers, sorts those, then matches to the full data set by record identifier. There may be a time penalty to reduce the disk usage.

You are trading off time for disk space as normally Proc Sort makes two additional copies of the data, roughly, before overwriting the source set (if not creating a different output data set) and removing the temporary data.

SAS Innovate 2025: Save the Date

 SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!

Save the date!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 2 replies
  • 595 views
  • 2 likes
  • 3 in conversation