BookmarkSubscribeRSS Feed
yudhishtirb
Calcite | Level 5

Hello Team,

 

Greetings

 

We have 2 datasets. On these 2 datasets we are performing some merge operations.

 

What will be impact if

1. input data sets are in Sorted order

2. Input data sets are not in sorted order

 

Thanks In advance

3 REPLIES 3
Kurt_Bremser
Super User

For a DATA step merge, sorting before the merge is mandatory. Exception: if the datasets are stored in SPDS, then the sort is done automatically (this does not reduce the overall resource consumption, you just don't need to write and execute the SORT step on your own)

If you use SAS SQL, no sorting is necessary, as SQL does this on the fly. But there is a big caution: with large datasets, the utilty file of the SQL step can grow out of proportion, and the random accesses to this file will be much less performant than the combined SORT/DATA steps. Perfomance penalties of orders of magnitude have been observed.

LinusH
Tourmaline | Level 20
Addition to @Kurt_Bremser exception list: implicit sort is also being done for SPDE and most SAS/ACCESS to RDBMS engines.
Data never sleeps
LinusH
Tourmaline | Level 20
Addition to @Kurt_Bremser exception list: implicit sort is also being done for SPDE and most SAS/ACCESS to RDBMS engines.
Data never sleeps

hackathon24-white-horiz.png

2025 SAS Hackathon: There is still time!

Good news: We've extended SAS Hackathon registration until Sept. 12, so you still have time to be part of our biggest event yet – our five-year anniversary!

Register Now

Creating Custom Steps in SAS Studio

Check out this tutorial series to learn how to build your own steps in SAS Studio.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 3 replies
  • 1577 views
  • 1 like
  • 3 in conversation