I have a hypothetical data as below.
If data in the 1st, 2nd, 3rd, 5th and 6th column are same and 4th column is either "A" or "B" then they will be considered as duplicate and one of those rows will be kept.
also
If data in the 1st, 2nd, 3rd, 5th and 6th column are same and 4th column is either "D" or "E" then they will be considered as duplicate and one of those rows will be kept.
So, the following rows will be considered as duplicates and one of them will be kept.
Rows 1 and 2
Rows 4 and 5
Rows 6 and 7
Rows 9 and 10
data have; input ID date Test $ SubTest $ SourceCode Result $; datalines; 1 1/1/21 ABC A 1 Positive 1 1/1/21 ABC B 1 Positive 1 1/1/21 ABC C 1 Negative 1 1/10/21 DEF D 1 Positive 1 1/10/21 DEF E 1 Positive 1 1/10/21 DEF F 1 Negative
2 1/1/21 ABC A 1 Negative 2 1/1/21 ABC B 1 Negative 2 1/1/21 ABC C 1 Negative 2 1/10/21 DEF D 1 Positive 2 1/10/21 DEF E 1 Positive 2 1/10/21 DEF F 1 Negative
3 1/1/21 ABC A 1 Positive 3 1/1/21 ABC B 1 Negative 3 1/1/21 ABC C 1 Negative 3 1/10/21 DEF D 1 Negative 3 1/10/21 DEF E 1 Positive 3 1/10/21 DEF F 1 Negative
4 1/1/21 ABC A 1 Positive 4 1/1/21 ABC B 2 Positive 4 1/1/21 ABC C 1 Negative 4 1/10/21 DEF D 1 Negative 4 1/10/21 DEF E 1 Positive 4 1/10/21 DEF F 1 Negative ; run;
... View more