Many times we've had to sort a table by some variables, then use another proc sort to eliminate duplicate observations based on a subset of the previous variables of the sorting. Eg:
proc sort;
by var1 var2 var3;
run;
proc sort nodupkey;
by var1 var2;
run;
It would be really useful if we could merge those two processes in one, to reduce the reading/writing load. Especially when dealing with larger tables, every rewrite in disk takes a LOT of time.
The example above could be reduced to:
proc sort nodupkey=var2;
by var1 var2 var3;
run;
Regards,
C.
See more ideas labeled with:
5 Comments
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.