BookmarkSubscribeRSS Feed
JasonNC
Quartz | Level 8

Hi Folks,

'when we are using options like noduprecs or nodupkey with Proc Sort does the dataset need to be sorted before

For ex:

1 menthod

Proc sort data= stats;

by state region;

run;

proc sort data= stats (noduprecs/nodupkey);

by state region;

run;

or

2 methos

Proc sort data=stats (noduprecs/nodupkey);

by state region;

run;

Is there any problem with 2 method.I think it will remove duplicate observations.

2 REPLIES 2
art297
Opal | Level 21

: First, I have NEVER seen a situation where NODUPRECs is of any value.  If you don't want duplicates, use NODUPKEY and specify the by variables that you want to ensure aren't duplicated.

Second, no, you don't have to presort your data unless you want to control which of the duplicate records gets deleted.  E.g., if the records included a date field and you only wanted to keep the most recent date, you might use two sorts like:

proc sort data= stats;

  by state region descending date;

run;

proc sort data= stats nodupkey;

  by state region;

run;

Reeza
Super User

Be aware of what noduprecs is doing versus nodupkey, there are separate options.

http://www2.sas.com/proceedings/sugi30/037-30.pdf

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 2 replies
  • 1070 views
  • 0 likes
  • 3 in conversation