<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to filter phrases and regular expressions? in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/How-to-filter-phrases-and-regular-expressions/m-p/244103#M3589</link>
    <description>&lt;P&gt;Half the people will recommend doing this transformations before importing data into EM, half the people will recommend doing it in EM.&lt;/P&gt;
&lt;P&gt;If I was to add it on EM, I would do it on a transform node (use the SAS code ellipsis!), hptransform node, or in a SAS code node.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;good luck!&lt;/P&gt;</description>
    <pubDate>Sun, 17 Jan 2016 22:41:03 GMT</pubDate>
    <dc:creator>M_Maldonado</dc:creator>
    <dc:date>2016-01-17T22:41:03Z</dc:date>
    <item>
      <title>How to filter phrases and regular expressions?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-to-filter-phrases-and-regular-expressions/m-p/244080#M3586</link>
      <description>&lt;P&gt;In SAS Enterprise Miner Workstation 13.2, I'm using some Text Mining nodes to build Text Topics.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, I noticed lots of phrases and tokens tha I would like filtered out of the data before analysis. Examples include html tags such as "&amp;lt;p&amp;gt;", and boilerplate text such as "This description was written by the Martin Group." I tried adding these things to the list of stop words, but that didn't seem to help: the terms still appeared in the created topics.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Is there a way to filter out multi-word phrases? And is there a way to filter out regular expressions, such as "This description was written by .*"?&lt;/P&gt;</description>
      <pubDate>Sun, 17 Jan 2016 20:26:09 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-to-filter-phrases-and-regular-expressions/m-p/244080#M3586</guid>
      <dc:creator>stepthom</dc:creator>
      <dc:date>2016-01-17T20:26:09Z</dc:date>
    </item>
    <item>
      <title>Re: How to filter phrases and regular expressions?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-to-filter-phrases-and-regular-expressions/m-p/244084#M3587</link>
      <description>&lt;P&gt;Regular expression matching is very flexible. There is almost certainly a way to do what you describe. But we need something more concrete to suggest good examples. Please give us a list of phrases that you would want to check and what you would expect as a result.&lt;/P&gt;</description>
      <pubDate>Sun, 17 Jan 2016 21:00:41 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-to-filter-phrases-and-regular-expressions/m-p/244084#M3587</guid>
      <dc:creator>PGStats</dc:creator>
      <dc:date>2016-01-17T21:00:41Z</dc:date>
    </item>
    <item>
      <title>Re: How to filter phrases and regular expressions?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-to-filter-phrases-and-regular-expressions/m-p/244099#M3588</link>
      <description>&lt;P&gt;Hi PG Stats,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I think I can handle the construction of the regular expression, that's not a problem. My question was trying to ask, where do I put them? (Which node, which field?) I couldn't find it.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;thanks!&lt;/P&gt;</description>
      <pubDate>Sun, 17 Jan 2016 22:15:40 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-to-filter-phrases-and-regular-expressions/m-p/244099#M3588</guid>
      <dc:creator>stepthom</dc:creator>
      <dc:date>2016-01-17T22:15:40Z</dc:date>
    </item>
    <item>
      <title>Re: How to filter phrases and regular expressions?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/How-to-filter-phrases-and-regular-expressions/m-p/244103#M3589</link>
      <description>&lt;P&gt;Half the people will recommend doing this transformations before importing data into EM, half the people will recommend doing it in EM.&lt;/P&gt;
&lt;P&gt;If I was to add it on EM, I would do it on a transform node (use the SAS code ellipsis!), hptransform node, or in a SAS code node.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;good luck!&lt;/P&gt;</description>
      <pubDate>Sun, 17 Jan 2016 22:41:03 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/How-to-filter-phrases-and-regular-expressions/m-p/244103#M3589</guid>
      <dc:creator>M_Maldonado</dc:creator>
      <dc:date>2016-01-17T22:41:03Z</dc:date>
    </item>
  </channel>
</rss>

