Hi Folks,
'when we are using options like noduprecs or nodupkey with Proc Sort does the dataset need to be sorted before
For ex:
1 menthod
Proc sort data= stats;
by state region;
run;
proc sort data= stats (noduprecs/nodupkey);
by state region;
run;
or
2 methos
Proc sort data=stats (noduprecs/nodupkey);
by state region;
run;
Is there any problem with 2 method.I think it will remove duplicate observations.
: First, I have NEVER seen a situation where NODUPRECs is of any value. If you don't want duplicates, use NODUPKEY and specify the by variables that you want to ensure aren't duplicated.
Second, no, you don't have to presort your data unless you want to control which of the duplicate records gets deleted. E.g., if the records included a date field and you only wanted to keep the most recent date, you might use two sorts like:
proc sort data= stats;
by state region descending date;
run;
proc sort data= stats nodupkey;
by state region;
run;
Be aware of what noduprecs is doing versus nodupkey, there are separate options.
April 27 – 30 | Gaylord Texan | Grapevine, Texas
Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!
Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.