Hello Team,
Greetings
We have 2 datasets. On these 2 datasets we are performing some merge operations.
What will be impact if
1. input data sets are in Sorted order
2. Input data sets are not in sorted order
Thanks In advance
For a DATA step merge, sorting before the merge is mandatory. Exception: if the datasets are stored in SPDS, then the sort is done automatically (this does not reduce the overall resource consumption, you just don't need to write and execute the SORT step on your own)
If you use SAS SQL, no sorting is necessary, as SQL does this on the fly. But there is a big caution: with large datasets, the utilty file of the SQL step can grow out of proportion, and the random accesses to this file will be much less performant than the combined SORT/DATA steps. Perfomance penalties of orders of magnitude have been observed.
Good news: We've extended SAS Hackathon registration until Sept. 12, so you still have time to be part of our biggest event yet – our five-year anniversary!
Check out this tutorial series to learn how to build your own steps in SAS Studio.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.