BookmarkSubscribeRSS Feed
raveena
Obsidian | Level 7
Hi ,

Please explain whats the main difference between dupkey and nodupkey in order to remove the duplicate records ?

Thanks in Advance !!!
2 REPLIES 2
SPR
Quartz | Level 8 SPR
Quartz | Level 8
Hello Raveena,

If you talking about proc SORT it has only NODUPKEY option (to my knowlege), and this is a description:

NODUPKEY
checks for and eliminates observations with duplicate BY values. If you specify this option, then PROC SORT compares all BY values for each observation to the ones for the previous observation that is written to the output data set. If an exact match is found, then the observation is not written to the output data set.

Sincerely,
SPR
Doc_Duke
Rhodochrosite | Level 12
Raveena,

Be careful with this. NODUPKEY will not remove duplicate records; it removes records with duplicate keys. The NODUP option gets closer, but it still assumes that you have sorted by a sufficient number of keys to uniquely identify the records.

http://support.sas.com/kb/1/566.html

Doc Muhlbaier
Duke

Ready to join fellow brilliant minds for the SAS Hackathon?

Build your skills. Make connections. Enjoy creative freedom. Maybe change the world. Registration is now open through August 30th. Visit the SAS Hackathon homepage.

Register today!
What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 2 replies
  • 2577 views
  • 0 likes
  • 3 in conversation