<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic SEMMA Cluster Analysis in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/SEMMA-Cluster-Analysis/m-p/528268#M7618</link>
    <description>&lt;P&gt;Hey,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I'm doing a cluster analysis on a large data set in SAS EM.&lt;/P&gt;&lt;P&gt;I would like to use the SEMMA approach for data mining.&lt;/P&gt;&lt;P&gt;But according to this approach cluster analysis is part of Explore.&lt;/P&gt;&lt;P&gt;But actually this is my model I guess.&lt;/P&gt;&lt;P&gt;My nodes are:&lt;/P&gt;&lt;P&gt;Input Data&lt;/P&gt;&lt;P&gt;Stat Explore&lt;/P&gt;&lt;P&gt;Drop&lt;/P&gt;&lt;P&gt;Filter&lt;/P&gt;&lt;P&gt;Impute&lt;/P&gt;&lt;P&gt;Data Partition&lt;/P&gt;&lt;P&gt;Varclus&lt;/P&gt;&lt;P&gt;Cluster&lt;/P&gt;&lt;P&gt;Score/Assess&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Maybe anyone can tell me the right order?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you!&lt;/P&gt;</description>
    <pubDate>Fri, 18 Jan 2019 08:35:49 GMT</pubDate>
    <dc:creator>SAS_ASS</dc:creator>
    <dc:date>2019-01-18T08:35:49Z</dc:date>
    <item>
      <title>SEMMA Cluster Analysis</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/SEMMA-Cluster-Analysis/m-p/528268#M7618</link>
      <description>&lt;P&gt;Hey,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I'm doing a cluster analysis on a large data set in SAS EM.&lt;/P&gt;&lt;P&gt;I would like to use the SEMMA approach for data mining.&lt;/P&gt;&lt;P&gt;But according to this approach cluster analysis is part of Explore.&lt;/P&gt;&lt;P&gt;But actually this is my model I guess.&lt;/P&gt;&lt;P&gt;My nodes are:&lt;/P&gt;&lt;P&gt;Input Data&lt;/P&gt;&lt;P&gt;Stat Explore&lt;/P&gt;&lt;P&gt;Drop&lt;/P&gt;&lt;P&gt;Filter&lt;/P&gt;&lt;P&gt;Impute&lt;/P&gt;&lt;P&gt;Data Partition&lt;/P&gt;&lt;P&gt;Varclus&lt;/P&gt;&lt;P&gt;Cluster&lt;/P&gt;&lt;P&gt;Score/Assess&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Maybe anyone can tell me the right order?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you!&lt;/P&gt;</description>
      <pubDate>Fri, 18 Jan 2019 08:35:49 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/SEMMA-Cluster-Analysis/m-p/528268#M7618</guid>
      <dc:creator>SAS_ASS</dc:creator>
      <dc:date>2019-01-18T08:35:49Z</dc:date>
    </item>
    <item>
      <title>Re: SEMMA Cluster Analysis</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/SEMMA-Cluster-Analysis/m-p/530643#M7630</link>
      <description>&lt;P&gt;Good morning-&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The "right" order depends on what you are trying to do.&amp;nbsp; The flow that you describe explores the data, drops some variables, filters some observations, imputes missing values, partitions, clusters variables, and then clusters observations based on the results of the variable clustering. If this strategy is your intent, then the order is probably right. It is not clear though how assessment is involved.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;SEMMA is usually involved when you have a target variable.&amp;nbsp; The target predictions can be assessed against the target values that were observed.&amp;nbsp; In cluster analysis, there is no target variable.&amp;nbsp; Instead, unsupervised learning is performed.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Have a good week.&lt;/P&gt;</description>
      <pubDate>Mon, 28 Jan 2019 14:58:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/SEMMA-Cluster-Analysis/m-p/530643#M7630</guid>
      <dc:creator>MikeStockstill</dc:creator>
      <dc:date>2019-01-28T14:58:39Z</dc:date>
    </item>
  </channel>
</rss>

