<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Counting Dictionary words in email address in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Counting-Dictionary-words-in-email-address/m-p/265851#M9589</link>
    <description>&lt;P&gt;Some word lists are available at&amp;nbsp;&lt;A href="https://sourceforge.net/projects/wordlist/files/latest/download?source=typ_redirect" target="_self"&gt;https://sourceforge.net/projects/wordlist/files/latest/download?source=typ_redirect&lt;/A&gt;.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Is this for targeted marketing?&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Sat, 23 Apr 2016 19:53:18 GMT</pubDate>
    <dc:creator>PGStats</dc:creator>
    <dc:date>2016-04-23T19:53:18Z</dc:date>
    <item>
      <title>Counting Dictionary words in email address</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Counting-Dictionary-words-in-email-address/m-p/265619#M9587</link>
      <description>&lt;P&gt;I have a huge list of email addresses. I want to determine number of dictionary words in each email address. Programming language I am using is SAS.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Ex - suppose the email addresses are as below. The output I require is - coolgirl@email.com --&amp;gt; 2 dictionary words - cool and girl angeldream@gmail.como --&amp;gt; 2 dictionary words - angel and dream&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can anyone suggest how to go about it.&lt;/P&gt;</description>
      <pubDate>Fri, 22 Apr 2016 11:16:06 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Counting-Dictionary-words-in-email-address/m-p/265619#M9587</guid>
      <dc:creator>nakulkothari</dc:creator>
      <dc:date>2016-04-22T11:16:06Z</dc:date>
    </item>
    <item>
      <title>Re: Counting Dictionary words in email address</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Counting-Dictionary-words-in-email-address/m-p/265627#M9588</link>
      <description>&lt;P&gt;Whilst its easy enouhg to get a list of words off the net, my first search came up with this:&lt;/P&gt;
&lt;P&gt;&lt;A href="https://github.com/dwyl/english-words" target="_blank"&gt;https://github.com/dwyl/english-words&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The question is how are you going to lexicographically parse a text string to find words? &amp;nbsp;There are many combinations, different meanings, different spellings etc. &amp;nbsp;Just take your example: coolgirl, what if it was coolaid? &amp;nbsp;Two separate words, or the company name? &amp;nbsp;What about&amp;nbsp;halfpipe, should it be half and pipe, or halfpipe?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I think your best bet would be to investigate text analytics if you really need to do this, although its another license:&lt;/P&gt;
&lt;P&gt;&lt;A href="http://www.sas.com/en_us/software/analytics/text-miner.html" target="_blank"&gt;http://www.sas.com/en_us/software/analytics/text-miner.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 22 Apr 2016 12:06:03 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Counting-Dictionary-words-in-email-address/m-p/265627#M9588</guid>
      <dc:creator>RW9</dc:creator>
      <dc:date>2016-04-22T12:06:03Z</dc:date>
    </item>
    <item>
      <title>Re: Counting Dictionary words in email address</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Counting-Dictionary-words-in-email-address/m-p/265851#M9589</link>
      <description>&lt;P&gt;Some word lists are available at&amp;nbsp;&lt;A href="https://sourceforge.net/projects/wordlist/files/latest/download?source=typ_redirect" target="_self"&gt;https://sourceforge.net/projects/wordlist/files/latest/download?source=typ_redirect&lt;/A&gt;.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Is this for targeted marketing?&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 23 Apr 2016 19:53:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Counting-Dictionary-words-in-email-address/m-p/265851#M9589</guid>
      <dc:creator>PGStats</dc:creator>
      <dc:date>2016-04-23T19:53:18Z</dc:date>
    </item>
    <item>
      <title>Re: Counting Dictionary words in email address</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Counting-Dictionary-words-in-email-address/m-p/266911#M9590</link>
      <description>No, this is not for targeted marketing.&lt;BR /&gt;&lt;BR /&gt;I am doing a project in which I need to determine number of dictionary words in the email handle.&lt;BR /&gt;&lt;BR /&gt;I am stuck in the question. And don't know how to proceed</description>
      <pubDate>Thu, 28 Apr 2016 07:19:38 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Counting-Dictionary-words-in-email-address/m-p/266911#M9590</guid>
      <dc:creator>nakulkothari</dc:creator>
      <dc:date>2016-04-28T07:19:38Z</dc:date>
    </item>
    <item>
      <title>Re: Counting Dictionary words in email address</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Counting-Dictionary-words-in-email-address/m-p/266921#M9591</link>
      <description>&lt;P&gt;&lt;SPAN&gt;No, this is not for targeted marketing.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I am doing a project in which I need to determine number of dictionary words in the email handle.&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;I am stuck in the question. And don't know how to proceed&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 28 Apr 2016 09:41:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Counting-Dictionary-words-in-email-address/m-p/266921#M9591</guid>
      <dc:creator>nakulkothari</dc:creator>
      <dc:date>2016-04-28T09:41:43Z</dc:date>
    </item>
  </channel>
</rss>

