Hi All, @hashmanFrequently I find that the limited memory is preventing Hash to run, but the data can still be processed through proc sort. I have a very large dataset, and I'm trying to reduce its size by the date of when the data is imported. The date variable is in string format such as the following: 2018-04-03 06:40:28.738 I will need to subset the dataset by date, if the size of the dataset were smaller, I will only need to use the where statement such as: where trim(substr(date,7,4)) = "4-03"; But this is taking a long time and sometime crashes, not to mention putting a datetime format on the original date variable were even slower or nearly impossible. Create a new variable with only date on the raw dataset was also pretty impossible.(If I could do that easily then I would not need to subset by date) I can create a hash table and a hash dataset that contains the date, but the original date variable has date + time. Therefore, it will not match. Following is a sample dataset with only date: data have;
input date $30;
cards;
2018-04-03 03:44:18.728
2018-04-03 07:40:02.221
2018-04-03 09:20:20.135
2018-04-03 14:50:11.752
2018-04-03 02:42:17.005
2018-04-05 01:22:20.264
2018-04-05 04:45:49.402
2018-04-06 04:09:50.710
2018-04-07 04:12:31.623
2018-04-11 04:11:01.528
; I have already tried to reduce variables, but it did not help much since the data is very long as well. Appreciate for any help!
... View more