<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Cluster analysis in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Cluster-analysis/m-p/16771#M419</link>
    <description>Thank you :-), now i have created dummy variables for the class variables, i will use them in clustering without standardizing. &lt;BR /&gt;
&lt;BR /&gt;
however i have some outliers in 2 variables (1 continuous variable and 1 discrete variable), so do i need to do the outlier treatment first or stadadrization first.</description>
    <pubDate>Mon, 18 Oct 2010 09:13:41 GMT</pubDate>
    <dc:creator>samHT</dc:creator>
    <dc:date>2010-10-18T09:13:41Z</dc:date>
    <item>
      <title>Cluster analysis</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Cluster-analysis/m-p/16769#M417</link>
      <description>hi,&lt;BR /&gt;
&lt;BR /&gt;
 I have 2 questions in my mind, can anyone help me on it. &lt;BR /&gt;
&lt;BR /&gt;
    1. Is it necessary to create dummy variables in cluster analysis? if yes&lt;BR /&gt;
    2. Do we need to standardize dummy variables along with other continuous variables to get ride of different measuring units&lt;BR /&gt;
&lt;BR /&gt;
&lt;BR /&gt;
Thanks</description>
      <pubDate>Thu, 14 Oct 2010 06:25:47 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Cluster-analysis/m-p/16769#M417</guid>
      <dc:creator>samHT</dc:creator>
      <dc:date>2010-10-14T06:25:47Z</dc:date>
    </item>
    <item>
      <title>Re: Cluster analysis</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Cluster-analysis/m-p/16770#M418</link>
      <description>The answers are in the PROC CLUSTER reference documentation.&lt;BR /&gt;
&lt;BR /&gt;
1) cluster works only against numeric variables.  So, if you have classification variables, you need to recode to dummies or apply some other sort of metric.&lt;BR /&gt;
2) no, you don't need to standardize.  See the examples.</description>
      <pubDate>Thu, 14 Oct 2010 17:02:46 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Cluster-analysis/m-p/16770#M418</guid>
      <dc:creator>Doc_Duke</dc:creator>
      <dc:date>2010-10-14T17:02:46Z</dc:date>
    </item>
    <item>
      <title>Re: Cluster analysis</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Cluster-analysis/m-p/16771#M419</link>
      <description>Thank you :-), now i have created dummy variables for the class variables, i will use them in clustering without standardizing. &lt;BR /&gt;
&lt;BR /&gt;
however i have some outliers in 2 variables (1 continuous variable and 1 discrete variable), so do i need to do the outlier treatment first or stadadrization first.</description>
      <pubDate>Mon, 18 Oct 2010 09:13:41 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Cluster-analysis/m-p/16771#M419</guid>
      <dc:creator>samHT</dc:creator>
      <dc:date>2010-10-18T09:13:41Z</dc:date>
    </item>
  </channel>
</rss>

