Solved: How to remove duplicate observations based on the value of the the thi...

zjppdozen · Posted 02-26-2021 02:19 PM

Hi all,

I have a data set like this (see below). There are duplicate observations within some survey waves. Now I want to remove the duplicate observations with the highest "visit age" from the sample. What should I do?

The original data set:

ID	WAVE	Visit age
1231	1	45
1231	2	46
1231	2	47
1232	1	55
1233	1	56
1234	1	34
1234	1	35
1234	2	38

The dataset that I want:

ID	WAVE	Visit age
1231	1	45
1231	2	46
1232	1	55
1233	1	56
1234	1	34
1234	2	38

Thank you very much!

data_null__ · Posted 02-26-2021 02:32 PM

@zjppdozen wrote:

Hi all,

I have a data set like this (see below). There are duplicate observations within some survey waves. Now I want to remove the duplicate observations with the highest "visit age" from the sample. What should I do?

The original data set:

ID WAVE Visit age

1231 1 45

1231 2 46

1231 2 47

1232 1 55

1233 1 56

1234 1 34

1234 1 35

1234 2 38

The dataset that I want:

ID WAVE Visit age

1231 1 45

1231 2 46

1232 1 55

1233 1 56

1234 1 34

1234 2 38

Thank you very much!

proc summary nway missing;
   class id wave;
   output out=deduped(drop=_type_) min('visit age'n)=;
   run;

View solution in original post

data_null__ · Posted 02-26-2021 02:32 PM

@zjppdozen wrote:

Hi all,

I have a data set like this (see below). There are duplicate observations within some survey waves. Now I want to remove the duplicate observations with the highest "visit age" from the sample. What should I do?

The original data set:

ID WAVE Visit age

1231 1 45

1231 2 46

1231 2 47

1232 1 55

1233 1 56

1234 1 34

1234 1 35

1234 2 38

The dataset that I want:

ID WAVE Visit age

1231 1 45

1231 2 46

1232 1 55

1233 1 56

1234 1 34

1234 2 38

Thank you very much!

proc summary nway missing;
   class id wave;
   output out=deduped(drop=_type_) min('visit age'n)=;
   run;

zjppdozen · Posted 04-06-2021 07:14 PM

This code works! Thank you very much!

How to remove duplicate observations based on the value of the the third variable?

Re: How to remove duplicate observations based on the value of the the third variable?

Re: How to remove duplicate observations based on the value of the the third variable?

Re: How to remove duplicate observations based on the value of the the third variable?

Catch up on SAS Innovate 2026

How to remove duplicate observations based on the value of the the third variable?

Re: How to remove duplicate observations based on the value of the the third variable?

Re: How to remove duplicate observations based on the value of the the third variable?

Re: How to remove duplicate observations based on the value of the the third variable?

Catch up on SAS Innovate 2026

SAS Training: Just a Click Away