Sort needs approximately 2.5 disk space relating to original dataset disk space.
Is the data already sorted by any key plus or including month ?
If positive you can do:
proc sort data=have(where=(month=<desired>))
out=month_sorted nodupkey;
by <key variables>;
run;
data new;
set have(where=(moth < <desired>))
month_sorted
have(where=(month > <desired>))
;
run;
Sort needs approximately 2.5 disk space relating to original dataset disk space.
Is the data already sorted by any key plus or including month ?
If positive you can do:
proc sort data=have(where=(month=<desired>))
out=month_sorted nodupkey;
by <key variables>;
run;
data new;
set have(where=(moth < <desired>))
month_sorted
have(where=(month > <desired>))
;
run;
try TAGSORT option of proc sort.
proc sort data=have out=month_sorted nodupkey tagsort sortsize=max;
run;
Hi Ksharp,
As Shmuel quoted "Sort needs approximately 2.5 disk space relating to original dataset disk space.". So in my case I tried with shmuel suggesition and CPU, I/O statistics looks good.
I would like to highlight one more point on the data set which I worked is, its a size of approxmately 1 TB since it was compressed its of 200 GB. So I obersved that working on segments of huge data set is looks fine.
Thanks for your suggesition!
Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.
Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.
Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.