I have searched everywhere, but I don't seem to find the answer. I think my problem is simple, but I can't get it right. My problem: In a dataset, I have patients having multiple diagnoses and duplicate diagnoses. I need to delete these duplicate diagnoses per patient, meaning another patient having the same diagnose should not be deleted. Example of dataset (not real data): Patient_ID Diagnose 1 1 Keep observation 1 1 Delete observation 1 2 Keep observation 1 3 Keep observation 2 1 Keep observation 2 1 Delete observation 2 3 Keep observation 2 3 Delete observation 3 1 Keep observation 3 2 Keep observation 3 5 Keep observation 3 6 Keep observation 4 1 Keep observation 4 1 Delete observation 4 3 Keep observation 4 3 Delete observation Thanks in advance!
... View more