Is there a way to group together the misspellings of words prior to creating a summary table so when I am searching the data set it uses the one word for all the misspellings (I have all the misspellings listed).
Example:
Replacing like for love, likes, liked, liker
Hi elainathewonder,
Greetings of the day.
I have done something for you, just have a check and let me know if you mean this.
data test;
patternID=prxparse("/L\w+E/o");
input address $80. ;
position = prxmatch(patternID, address);
if position ^= 0 then address= tranwrd(address,substr(address,POSITION,5),'Love');
datalines;
Zack Johnson, 153 LirsE Str, Chapel Hill, NC27514
Dan Zack, 67891 64th st, Brea, CA
Sally Johns, 4 Moritz LtreE, Duarte, CA 91010
;
run;
In the above example LirsE, LtreE few words are there which got replaced with a common word 'Love'. So if think like there is some kind of similarity in the misspelling texts you can identify that and parse the same as per example and you are done.
Please check and let me know if there is any disconnect.
April 27 – 30 | Gaylord Texan | Grapevine, Texas
Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!
Still thinking about your presentation idea? The submission deadline has been extended to Friday, Nov. 14, at 11:59 p.m. ET.
Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.
Find more tutorials on the SAS Users YouTube channel.