Hi Folks,
'when we are using options like noduprecs or nodupkey with Proc Sort does the dataset need to be sorted before
For ex:
1 menthod
Proc sort data= stats;
by state region;
run;
proc sort data= stats (noduprecs/nodupkey);
by state region;
run;
or
2 methos
Proc sort data=stats (noduprecs/nodupkey);
by state region;
run;
Is there any problem with 2 method.I think it will remove duplicate observations.
: First, I have NEVER seen a situation where NODUPRECs is of any value. If you don't want duplicates, use NODUPKEY and specify the by variables that you want to ensure aren't duplicated.
Second, no, you don't have to presort your data unless you want to control which of the duplicate records gets deleted. E.g., if the records included a date field and you only wanted to keep the most recent date, you might use two sorts like:
proc sort data= stats;
by state region descending date;
run;
proc sort data= stats nodupkey;
by state region;
run;
Be aware of what noduprecs is doing versus nodupkey, there are separate options.
Don't miss out on SAS Innovate - Register now for the FREE Livestream!
Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.
Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.