Hi Folks,
'when we are using options like noduprecs or nodupkey with Proc Sort does the dataset need to be sorted before
For ex:
1 menthod
Proc sort data= stats;
by state region;
run;
proc sort data= stats (noduprecs/nodupkey);
by state region;
run;
or
2 methos
Proc sort data=stats (noduprecs/nodupkey);
by state region;
run;
Is there any problem with 2 method.I think it will remove duplicate observations.
: First, I have NEVER seen a situation where NODUPRECs is of any value. If you don't want duplicates, use NODUPKEY and specify the by variables that you want to ensure aren't duplicated.
Second, no, you don't have to presort your data unless you want to control which of the duplicate records gets deleted. E.g., if the records included a date field and you only wanted to keep the most recent date, you might use two sorts like:
proc sort data= stats;
by state region descending date;
run;
proc sort data= stats nodupkey;
by state region;
run;
Be aware of what noduprecs is doing versus nodupkey, there are separate options.
Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!
Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.