<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Mine or Scrub Websites w/o SAS Text Miner in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Mine-or-Scrub-Websites-w-o-SAS-Text-Miner/m-p/409601#M100069</link>
    <description>&lt;P&gt;At work, was asked a question about SAS and not sure of the answer. Basically, we want to SAS to search the internet (maybe just clinicaltrials.gov) or RSS feeds (RSS is probably easier) to see when companies post about their new studies using various search terms (like, hypertension). SAS can run this overnight and would email us the results every morning. I can do the email and overnight code, but I have no clue how to do this potentially text mining procedure with only using SAS 9. Sounds pretty easy, but not sure where to start. Any ideas?&lt;/P&gt;</description>
    <pubDate>Wed, 01 Nov 2017 19:31:26 GMT</pubDate>
    <dc:creator>dereck255</dc:creator>
    <dc:date>2017-11-01T19:31:26Z</dc:date>
    <item>
      <title>Mine or Scrub Websites w/o SAS Text Miner</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Mine-or-Scrub-Websites-w-o-SAS-Text-Miner/m-p/409601#M100069</link>
      <description>&lt;P&gt;At work, was asked a question about SAS and not sure of the answer. Basically, we want to SAS to search the internet (maybe just clinicaltrials.gov) or RSS feeds (RSS is probably easier) to see when companies post about their new studies using various search terms (like, hypertension). SAS can run this overnight and would email us the results every morning. I can do the email and overnight code, but I have no clue how to do this potentially text mining procedure with only using SAS 9. Sounds pretty easy, but not sure where to start. Any ideas?&lt;/P&gt;</description>
      <pubDate>Wed, 01 Nov 2017 19:31:26 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Mine-or-Scrub-Websites-w-o-SAS-Text-Miner/m-p/409601#M100069</guid>
      <dc:creator>dereck255</dc:creator>
      <dc:date>2017-11-01T19:31:26Z</dc:date>
    </item>
    <item>
      <title>Re: Mine or Scrub Websites w/o SAS Text Miner</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Mine-or-Scrub-Websites-w-o-SAS-Text-Miner/m-p/409662#M100091</link>
      <description>&lt;P&gt;You'll need to use the usual text manipulation functions: find(), scan(), spelling distances,&amp;nbsp;and so on, to analyse the pages.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You'll also need to&amp;nbsp;access and read the web sites. Look at : filename method URL&amp;nbsp;&amp;amp; proc http.&lt;/P&gt;</description>
      <pubDate>Wed, 01 Nov 2017 22:08:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Mine-or-Scrub-Websites-w-o-SAS-Text-Miner/m-p/409662#M100091</guid>
      <dc:creator>ChrisNZ</dc:creator>
      <dc:date>2017-11-01T22:08:39Z</dc:date>
    </item>
    <item>
      <title>Re: Mine or Scrub Websites w/o SAS Text Miner</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Mine-or-Scrub-Websites-w-o-SAS-Text-Miner/m-p/409680#M100100</link>
      <description>&lt;P&gt;Consider gathering the data outside of SAS and then use SAS for analysis. Languages such as C# and Python are much better at web technologies such as REST, RSS, etc. Dump to tab-delimited files or an RDBMS then have SAS chug away at it.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;This is a good case, IMO, or combo technologies vs relying on one or another. There are also existing tools that can do the monitoring vs rolling your own. Command l;ine them then have SAS execute them for you.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 01 Nov 2017 23:15:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Mine-or-Scrub-Websites-w-o-SAS-Text-Miner/m-p/409680#M100100</guid>
      <dc:creator>AlanC</dc:creator>
      <dc:date>2017-11-01T23:15:30Z</dc:date>
    </item>
  </channel>
</rss>

