I think this is a very challenging problem I am facing and I have no idea how to deal with it
Suppose I have two csv files
Toyota Inc. Camry, 2000km
Honda Corp Civic,1500km
HondaUSA Inf, 2000, 2300km
I want to generate C.csv
Toyota Camry,1998,blue ,2000km
The worst part of the task is that there needs to be error tolerance to deal with the variations in the company name
3.phrases such as Inc, corp.
4.Create a list of manual translation tables(Acura translates to HondaUSA)
You could create a list of words you are looking for e.g. Honda Acura Nissan
and use the indexw function to look for them in the variable. If you find the word then put that word into a variable. Do this on both datasets, then change any Accura to to Honda etc.
Remember the INDEXW function is looking for words, so it uses a delimiter. So it won't pick the Honda out of HondaUSA. It is looking for words, not character strings. You can use regular expressions for that if you are really keen.
When you have a standard variable in each data set you can merge them.