<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Scoring and Classifying New Data in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Scoring-and-Classifying-New-Data/m-p/178996#M9412</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi Chris -&lt;/P&gt;&lt;P&gt;This paper on custom entities may be of interest:&amp;nbsp; &lt;A href="http://www.sas.com/en_us/whitepapers/discovering-what-you-want-107347.html"&gt;http://www.sas.com/en_us/whitepapers/discovering-what-you-want-107347.html&lt;/A&gt; &lt;/P&gt;&lt;P&gt;It's a way to include pre-defined entities into a discovery analysis with SAS Text Miner.&lt;/P&gt;&lt;P&gt;You may also be interested in the text profile node in SAS Text Miner, used to associated descriptive terms with different levels of a dependent (target) variable - including time.&lt;/P&gt;&lt;P&gt;Hope this helps,&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Mon, 09 Feb 2015 18:36:30 GMT</pubDate>
    <dc:creator>FionaMcNeill</dc:creator>
    <dc:date>2015-02-09T18:36:30Z</dc:date>
    <item>
      <title>Scoring and Classifying New Data</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Scoring-and-Classifying-New-Data/m-p/178995#M9411</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Is there a way to score new data against an existing cluster set and to identify new clusters that are in that new data?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Perhaps an example would be best.&amp;nbsp; Let's say there is an event coming up about the environment.&amp;nbsp; I have a dataset of 10,00 online news articles over the last month that talk about the environment.&amp;nbsp; Some are off-topic, like computer environments or school environments, and others are on-topic, like carbon emissions or sea water rising.&amp;nbsp; So far, so good.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Now the event is held, a big international event.&amp;nbsp; Lots of press coverage.&amp;nbsp; My task ot to see how the on-topic conversations changed in volume and how long that change lasted.&amp;nbsp; &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;So let's say that after the event is held I download a new set of online news articles, say 20,000 records this time.&amp;nbsp; What I want to do are two things. First, I want to score the new data against the rules that were built in the pre-event processing.&amp;nbsp; Think of it as an apples to apples analysis: using the same rules, the conversations on carbon emissions grew by X percent and lasted Y days; the conversations about sea water rising grew by A percent and lasted B days; the conversations on computer environments and school enviroments did not change.&amp;nbsp; However, and this to me is the tough part, I want to uncover new topics (custers) that may arise.&amp;nbsp; So let's say that after the event a discussion about solar energy emerges that was not in the discussions prior to the event.&amp;nbsp; (I know this sounds weird, but it happens to be true because I've already done the pre-event analysis).&amp;nbsp; How do I identify these new cluster did not exist in the existing clustering routine?&amp;nbsp; &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 12 Jan 2015 18:39:11 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Scoring-and-Classifying-New-Data/m-p/178995#M9411</guid>
      <dc:creator>ChrisFromMaryland</dc:creator>
      <dc:date>2015-01-12T18:39:11Z</dc:date>
    </item>
    <item>
      <title>Re: Scoring and Classifying New Data</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Scoring-and-Classifying-New-Data/m-p/178996#M9412</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi Chris -&lt;/P&gt;&lt;P&gt;This paper on custom entities may be of interest:&amp;nbsp; &lt;A href="http://www.sas.com/en_us/whitepapers/discovering-what-you-want-107347.html"&gt;http://www.sas.com/en_us/whitepapers/discovering-what-you-want-107347.html&lt;/A&gt; &lt;/P&gt;&lt;P&gt;It's a way to include pre-defined entities into a discovery analysis with SAS Text Miner.&lt;/P&gt;&lt;P&gt;You may also be interested in the text profile node in SAS Text Miner, used to associated descriptive terms with different levels of a dependent (target) variable - including time.&lt;/P&gt;&lt;P&gt;Hope this helps,&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 09 Feb 2015 18:36:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Scoring-and-Classifying-New-Data/m-p/178996#M9412</guid>
      <dc:creator>FionaMcNeill</dc:creator>
      <dc:date>2015-02-09T18:36:30Z</dc:date>
    </item>
  </channel>
</rss>

