<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Cluster Analysis with SAS, When the data are mixed in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Cluster-Analysis-with-SAS-When-the-data-are-mixed/m-p/47343#M2063</link>
    <description>Hi..&lt;BR /&gt;
&lt;BR /&gt;
I have never done the cluster analysis with SAS before. I have read the websites and etc. The details are lengthy. Therefore, I am still confused about what are the general steps in performing cluster analysis with SAS. In some software, I could just load the raw data and then I got the results. Can anyone tell me so I got a rough idea of how to do it so that I could have a general idea about where/what topic I should be focusing???&lt;BR /&gt;
&lt;BR /&gt;
I have a data set of about 200,000 observations with about 30-35 attributes. All of them is raw transaction data. Some attributes are categorical values (with many possible categories). Some are numeric. Some are 0 and 1. I am looking to find anomalous or suspicious transactions (outliers). Can anyone tell me the general steps that I should follow in performing cluster analysis??&lt;BR /&gt;
&lt;BR /&gt;
Thank you so much in advance.&lt;BR /&gt;
&lt;BR /&gt;
Best,&lt;BR /&gt;
Panda</description>
    <pubDate>Mon, 22 Jun 2009 03:08:54 GMT</pubDate>
    <dc:creator>deleted_user</dc:creator>
    <dc:date>2009-06-22T03:08:54Z</dc:date>
    <item>
      <title>Cluster Analysis with SAS, When the data are mixed</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Cluster-Analysis-with-SAS-When-the-data-are-mixed/m-p/47343#M2063</link>
      <description>Hi..&lt;BR /&gt;
&lt;BR /&gt;
I have never done the cluster analysis with SAS before. I have read the websites and etc. The details are lengthy. Therefore, I am still confused about what are the general steps in performing cluster analysis with SAS. In some software, I could just load the raw data and then I got the results. Can anyone tell me so I got a rough idea of how to do it so that I could have a general idea about where/what topic I should be focusing???&lt;BR /&gt;
&lt;BR /&gt;
I have a data set of about 200,000 observations with about 30-35 attributes. All of them is raw transaction data. Some attributes are categorical values (with many possible categories). Some are numeric. Some are 0 and 1. I am looking to find anomalous or suspicious transactions (outliers). Can anyone tell me the general steps that I should follow in performing cluster analysis??&lt;BR /&gt;
&lt;BR /&gt;
Thank you so much in advance.&lt;BR /&gt;
&lt;BR /&gt;
Best,&lt;BR /&gt;
Panda</description>
      <pubDate>Mon, 22 Jun 2009 03:08:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Cluster-Analysis-with-SAS-When-the-data-are-mixed/m-p/47343#M2063</guid>
      <dc:creator>deleted_user</dc:creator>
      <dc:date>2009-06-22T03:08:54Z</dc:date>
    </item>
    <item>
      <title>Re: Cluster Analysis with SAS, When the data are mixed</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Cluster-Analysis-with-SAS-When-the-data-are-mixed/m-p/47344#M2064</link>
      <description>You might start by reading the Introduction to Clustering Procedures:&lt;BR /&gt;
&lt;BR /&gt;
&lt;A href="http://support.sas.com/onlinedoc/913/getDoc/en/statug.hlp/introclus_index.htm" target="_blank"&gt;http://support.sas.com/onlinedoc/913/getDoc/en/statug.hlp/introclus_index.htm&lt;/A&gt;</description>
      <pubDate>Mon, 22 Jun 2009 18:30:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Cluster-Analysis-with-SAS-When-the-data-are-mixed/m-p/47344#M2064</guid>
      <dc:creator>sfleming</dc:creator>
      <dc:date>2009-06-22T18:30:21Z</dc:date>
    </item>
    <item>
      <title>Re: Cluster Analysis with SAS, When the data are mixed</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Cluster-Analysis-with-SAS-When-the-data-are-mixed/m-p/47345#M2065</link>
      <description>If you have Enterprise Guide 4.1, then it's really easy to get started on Clustering. Go to Analyze --&amp;gt; Multivariate --&amp;gt; Cluster analysis. &lt;BR /&gt;
&lt;BR /&gt;
I have just started playing around with the cluster procedure, here are some things to keep in mind:&lt;BR /&gt;
 - I am not sure if the procedure handles character (you might want to convert the categorical values into nominal values)&lt;BR /&gt;
 - you might have to standardize the data (for example - if you have raw number of transactions everyday - try converting them to percentages of some sort)&lt;BR /&gt;
&lt;BR /&gt;
Hope this helps!&lt;BR /&gt;
kdp</description>
      <pubDate>Mon, 29 Jun 2009 19:44:13 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Cluster-Analysis-with-SAS-When-the-data-are-mixed/m-p/47345#M2065</guid>
      <dc:creator>kdp</dc:creator>
      <dc:date>2009-06-29T19:44:13Z</dc:date>
    </item>
    <item>
      <title>GAP analysis?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Cluster-Analysis-with-SAS-When-the-data-are-mixed/m-p/47346#M2066</link>
      <description>We are using SAS for cluster analysis, and wonder if anyone has a protocol for GAP analysis to determine the optimal number of clusters? khamil</description>
      <pubDate>Thu, 04 Mar 2010 16:33:27 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Cluster-Analysis-with-SAS-When-the-data-are-mixed/m-p/47346#M2066</guid>
      <dc:creator>khamil</dc:creator>
      <dc:date>2010-03-04T16:33:27Z</dc:date>
    </item>
  </channel>
</rss>

