Hello team,
I am reviewing a log for the program I ran. I can see this:
Tagsort reads each observation of the input data set twice.
Is this a mechanism in SAS that tagsort reads each observation two times?
Respectfully,
blue blue
That is normal. The procedure gets the BY variables and the record identifiers, sorts those, then matches to the full data set by record identifier. There may be a time penalty to reduce the disk usage.
You are trading off time for disk space as normally Proc Sort makes two additional copies of the data, roughly, before overwriting the source set (if not creating a different output data set) and removing the temporary data.
Yes.
Do you have access to documentation about PROC SORT?
That is normal. The procedure gets the BY variables and the record identifiers, sorts those, then matches to the full data set by record identifier. There may be a time penalty to reduce the disk usage.
You are trading off time for disk space as normally Proc Sort makes two additional copies of the data, roughly, before overwriting the source set (if not creating a different output data set) and removing the temporary data.
Registration is open! SAS is returning to Vegas for an AI and analytics experience like no other! Whether you're an executive, manager, end user or SAS partner, SAS Innovate is designed for everyone on your team. Register for just $495 by 12/31/2023.
If you are interested in speaking, there is still time to submit a session idea. More details are posted on the website.
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.