BookmarkSubscribeRSS Feed
Walternate
Obsidian | Level 7

Hi,

 

I have a dataset at the person level but with duplicate rows. It has ID and character variables A, B, and C. I wanted unique rows, so I ran this code:

 

proc sort nodupkey data=have;

by ID char_A char_B char_C;

run;

 

It worked without producing an error message, but when looking through the data I noticed that at least one duplicate row remained.

 

ID   Char_A        Char_B      Char_C

1         abc- d       def_g         ghi

1         abc- d       def_g         ghi

 

I'm not sure why this row remained in the data, as it looks like most of the duplicate rows were correctly deleted. Is there a way to troubleshoot and figure out whether there's some minor difference between the character variables or some other reason that the duplicate row wasn't removed?

 

Thanks!

 

 

 

1 REPLY 1
data_null__
Jade | Level 19

Display the values of the BY variables for the suspect observations using $HEX format, I expect you will find they are different.  There is probably a character that is displayed as a space but is not, or you have a different number of leading spaces.

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 1 reply
  • 737 views
  • 1 like
  • 2 in conversation