@eiger wrote:
The library is a folder on a remote drive. The file corpxin.items_202001 is a .sas7bdat dataset.
Yes, the where clause is just a simple equality test.
I am going to say a combination of the data step having to read every observation coupled with your network bandwidth or traffic.
Several years ago I copied network data sets to my local drive because the amount of time to access a data set with 5,000 observations was excessive. As in running proc freq on a couple of variables could take 15 minutes because of the amount of network traffic using that drive. Running with the local copy was a few seconds. I can image that if your data set with a million plus observations was on that drive with the network in effect that it could take hours to complete.
... View more