I agree. I think the data will never clean 100 percent on SCAN, TRANWRD, STRIP, SUBSTR, etc alone. I have to match a target file to a reference file. My plan is to auto clean down the relatively easy stuff (upper-lower case, PO boxes, rural routes), do all the exact matches first, then fuzzy match (using compged), and then hand clean. If I can get the match rate in the high 90's percent wise with less than 100 hand cleanings required I will call it a success. I'm starting with at this point no more than 150,000 records in the target file..
... View more