01-31-2017 12:30 PM
I have a dataset with thousdsand of visits.
We have first name, last name, address and medical record number. When people enter the data there will be errors, and we want to make sure we capture all the visits for the right person (based on first name, last name address and medical record number).
The data entry people to mix up the number in the MRN, switch letters in the name and cause us to count to many...
I need to clean the data and remove records that are like other records.
Does this help?
02-01-2017 02:19 AM
It would help a lot if you could post some sample data.
When you say you want to remove records that are like other records, does that mean that the records have to match for every variable in your dataset to be removed or only by some of the variables?