BookmarkSubscribeRSS Feed
raveena
Obsidian | Level 7
Hi ,

Please explain whats the main difference between dupkey and nodupkey in order to remove the duplicate records ?

Thanks in Advance !!!
2 REPLIES 2
SPR
Quartz | Level 8 SPR
Quartz | Level 8
Hello Raveena,

If you talking about proc SORT it has only NODUPKEY option (to my knowlege), and this is a description:

NODUPKEY
checks for and eliminates observations with duplicate BY values. If you specify this option, then PROC SORT compares all BY values for each observation to the ones for the previous observation that is written to the output data set. If an exact match is found, then the observation is not written to the output data set.

Sincerely,
SPR
Doc_Duke
Rhodochrosite | Level 12
Raveena,

Be careful with this. NODUPKEY will not remove duplicate records; it removes records with duplicate keys. The NODUP option gets closer, but it still assumes that you have sorted by a sufficient number of keys to uniquely identify the records.

http://support.sas.com/kb/1/566.html

Doc Muhlbaier
Duke

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 2 replies
  • 2513 views
  • 0 likes
  • 3 in conversation