BookmarkSubscribeRSS Feed
raveena
Obsidian | Level 7
Hi ,

Please explain whats the main difference between dupkey and nodupkey in order to remove the duplicate records ?

Thanks in Advance !!!
2 REPLIES 2
SPR
Quartz | Level 8 SPR
Quartz | Level 8
Hello Raveena,

If you talking about proc SORT it has only NODUPKEY option (to my knowlege), and this is a description:

NODUPKEY
checks for and eliminates observations with duplicate BY values. If you specify this option, then PROC SORT compares all BY values for each observation to the ones for the previous observation that is written to the output data set. If an exact match is found, then the observation is not written to the output data set.

Sincerely,
SPR
Doc_Duke
Rhodochrosite | Level 12
Raveena,

Be careful with this. NODUPKEY will not remove duplicate records; it removes records with duplicate keys. The NODUP option gets closer, but it still assumes that you have sorted by a sufficient number of keys to uniquely identify the records.

http://support.sas.com/kb/1/566.html

Doc Muhlbaier
Duke

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 2 replies
  • 2964 views
  • 0 likes
  • 3 in conversation