06-28-2016 07:06 AM
We have 2 datasets. On these 2 datasets we are performing some merge operations.
What will be impact if
1. input data sets are in Sorted order
2. Input data sets are not in sorted order
Thanks In advance
06-28-2016 08:16 AM
For a DATA step merge, sorting before the merge is mandatory. Exception: if the datasets are stored in SPDS, then the sort is done automatically (this does not reduce the overall resource consumption, you just don't need to write and execute the SORT step on your own)
If you use SAS SQL, no sorting is necessary, as SQL does this on the fly. But there is a big caution: with large datasets, the utilty file of the SQL step can grow out of proportion, and the random accesses to this file will be much less performant than the combined SORT/DATA steps. Perfomance penalties of orders of magnitude have been observed.