<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Correcting Misspelled Words In A Data Set in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Correcting-Misspelled-Words-In-A-Data-Set/m-p/487787#M287325</link>
    <description>&lt;P&gt;Well, you could use soundex:&lt;/P&gt;
&lt;P&gt;&lt;A href="http://support.sas.com/documentation/cdl/en/lrdict/64316/HTML/default/viewer.htm#a000245948.htm" target="_blank"&gt;http://support.sas.com/documentation/cdl/en/lrdict/64316/HTML/default/viewer.htm#a000245948.htm&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://blogs.sas.com/content/sgf/2015/01/27/how-to-perform-a-fuzzy-match-using-sas-functions/" target="_blank"&gt;https://blogs.sas.com/content/sgf/2015/01/27/how-to-perform-a-fuzzy-match-using-sas-functions/&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Problem is what happens if one of your mispelt ones is actually another name, or exactly between the two.&amp;nbsp; Its really a data capture issue, if a pre-defined list of companies was to be used then it should have been a select from dropdown rather than free text, free text is always an issue.&lt;/P&gt;</description>
    <pubDate>Fri, 17 Aug 2018 14:36:04 GMT</pubDate>
    <dc:creator>RW9</dc:creator>
    <dc:date>2018-08-17T14:36:04Z</dc:date>
    <item>
      <title>Correcting Misspelled Words In A Data Set</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Correcting-Misspelled-Words-In-A-Data-Set/m-p/487772#M287324</link>
      <description>&lt;P&gt;I have a lookup table of manufacturer name.&amp;nbsp; I have a second table of misspelled manufacturer names.&amp;nbsp; How can I get the misspelled words corrected in a new, third table showing the before and after using SAS code?&amp;nbsp; My Data set has thousands of distinct misspelled words.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Lookup Table:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Samsung&lt;/P&gt;&lt;P&gt;Sony&lt;/P&gt;&lt;P&gt;Nokia&lt;/P&gt;&lt;P&gt;Dell&lt;/P&gt;&lt;P&gt;Apple&lt;/P&gt;&lt;P&gt;HP&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Misspelled Names Table:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Samesung&lt;/P&gt;&lt;P&gt;Soni&lt;/P&gt;&lt;P&gt;Nokya&lt;/P&gt;&lt;P&gt;Doll&lt;/P&gt;&lt;P&gt;Apele&lt;/P&gt;&lt;P&gt;HC&lt;/P&gt;</description>
      <pubDate>Fri, 17 Aug 2018 14:13:01 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Correcting-Misspelled-Words-In-A-Data-Set/m-p/487772#M287324</guid>
      <dc:creator>Fistful_Dollars</dc:creator>
      <dc:date>2018-08-17T14:13:01Z</dc:date>
    </item>
    <item>
      <title>Re: Correcting Misspelled Words In A Data Set</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Correcting-Misspelled-Words-In-A-Data-Set/m-p/487787#M287325</link>
      <description>&lt;P&gt;Well, you could use soundex:&lt;/P&gt;
&lt;P&gt;&lt;A href="http://support.sas.com/documentation/cdl/en/lrdict/64316/HTML/default/viewer.htm#a000245948.htm" target="_blank"&gt;http://support.sas.com/documentation/cdl/en/lrdict/64316/HTML/default/viewer.htm#a000245948.htm&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://blogs.sas.com/content/sgf/2015/01/27/how-to-perform-a-fuzzy-match-using-sas-functions/" target="_blank"&gt;https://blogs.sas.com/content/sgf/2015/01/27/how-to-perform-a-fuzzy-match-using-sas-functions/&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Problem is what happens if one of your mispelt ones is actually another name, or exactly between the two.&amp;nbsp; Its really a data capture issue, if a pre-defined list of companies was to be used then it should have been a select from dropdown rather than free text, free text is always an issue.&lt;/P&gt;</description>
      <pubDate>Fri, 17 Aug 2018 14:36:04 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Correcting-Misspelled-Words-In-A-Data-Set/m-p/487787#M287325</guid>
      <dc:creator>RW9</dc:creator>
      <dc:date>2018-08-17T14:36:04Z</dc:date>
    </item>
    <item>
      <title>Re: Correcting Misspelled Words In A Data Set</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Correcting-Misspelled-Words-In-A-Data-Set/m-p/487827#M287326</link>
      <description>&lt;P&gt;Thanks, RW9.&amp;nbsp; The code from that blog page helped.&amp;nbsp; Not super accurate, but like you say that's the issue with free-form text and no drop-down.&lt;/P&gt;</description>
      <pubDate>Fri, 17 Aug 2018 15:56:28 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Correcting-Misspelled-Words-In-A-Data-Set/m-p/487827#M287326</guid>
      <dc:creator>Fistful_Dollars</dc:creator>
      <dc:date>2018-08-17T15:56:28Z</dc:date>
    </item>
    <item>
      <title>Re: Correcting Misspelled Words In A Data Set</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Correcting-Misspelled-Words-In-A-Data-Set/m-p/487843#M287327</link>
      <description>&lt;P&gt;If it makes you feel any better I've had to clean up such. One project where had an expected list of company names to be common answers we provided the data collection people with what should be entered. I had 18 spellings for what should have been IBM. The most entertaining of which was I&amp;gt;B&amp;gt;M&amp;gt;&lt;/P&gt;</description>
      <pubDate>Fri, 17 Aug 2018 17:02:17 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Correcting-Misspelled-Words-In-A-Data-Set/m-p/487843#M287327</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2018-08-17T17:02:17Z</dc:date>
    </item>
    <item>
      <title>Re: Correcting Misspelled Words In A Data Set</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Correcting-Misspelled-Words-In-A-Data-Set/m-p/487913#M287328</link>
      <description>&lt;P&gt;You can check if your company has DataFlux. It's designed for exactly this type of matching.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/149362"&gt;@Fistful_Dollars&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;P&gt;I have a lookup table of manufacturer name.&amp;nbsp; I have a second table of misspelled manufacturer names.&amp;nbsp; How can I get the misspelled words corrected in a new, third table showing the before and after using SAS code?&amp;nbsp; My Data set has thousands of distinct misspelled words.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Lookup Table:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Samsung&lt;/P&gt;
&lt;P&gt;Sony&lt;/P&gt;
&lt;P&gt;Nokia&lt;/P&gt;
&lt;P&gt;Dell&lt;/P&gt;
&lt;P&gt;Apple&lt;/P&gt;
&lt;P&gt;HP&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Misspelled Names Table:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Samesung&lt;/P&gt;
&lt;P&gt;Soni&lt;/P&gt;
&lt;P&gt;Nokya&lt;/P&gt;
&lt;P&gt;Doll&lt;/P&gt;
&lt;P&gt;Apele&lt;/P&gt;
&lt;P&gt;HC&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 17 Aug 2018 19:33:46 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Correcting-Misspelled-Words-In-A-Data-Set/m-p/487913#M287328</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2018-08-17T19:33:46Z</dc:date>
    </item>
  </channel>
</rss>

