Dear All,
I am working on a dataset with 1 million observations.
During the process of booking, the booking team manually enteres the Company name in the system which further integrates to our ERP.
Now a name could be written like "ABC Ltd", "*ABC Ltd", "abcltd" "abc ltd" and in many other formats.
my objetive is to create one single entry lets say "ABC Ltd" which should have all the relavant count of entires.
A case statement would have been perect here however there are like 1000 duplicates hence for these i cannot implement a case statement.
If you could please suggest an alternate.
... View more