I know the sequence of characters that I want to compress. I am looking at internet search data and not everyone searches for a given company in the same way. So, if I were interested in UUNET users may type in U U NET, UU NET, U UNET, etc. The solution I am looking for does not have to pick up the instances where the characters are transposed (since the vast majority will put the characters in the correct order) just the instances where the charaters are in the 'right' order but the spacing is different.
... View more