HI, I have a dataset that looks like the following. There are duplicate record IDs due to minor typos in the firstname and lastname variables. Record FirstName LastName Gender 009 Jason Aman M 009 Jazon Aman M 100 Marshall Reid F 101 Reed Jones M 102 Aundo Kalen F 102 Aundy Kalen F 110 Jack Sarz M I would like to fix these duplicates, so there's only one line per record number. I've started fixing them manually, but there are at least 700 observations. This is what I want the dataset to look like: Record FirstName LastName Gender 009 Jason Aman M 100 Marshall Reid F 101 Reed Jones M 102 Aundy Kalen F 110 Jack Sarz M Before continuing with the manual approach, I wanted to check and see if anyone had ideas of a are more programmatic way to fix these duplicate names?
... View more