<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic MIxed type of variables in cluster analaysis in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/MIxed-type-of-variables-in-cluster-analaysis/m-p/428867#M22524</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I&amp;nbsp;am doing a cluster analysis with 10 continuous variables and 3 categorical variables.&amp;nbsp;Instead of converting&amp;nbsp;categorical variables into dummies, I am thinking of creating distance matrix&amp;nbsp;using "PROC DISTANCE".&lt;/P&gt;&lt;P&gt;1) Calculate 3 sets of distance&amp;nbsp;matrix and each set contains the distance between one&amp;nbsp;categorical variable(id category_var1)&amp;nbsp;and 10 continuous variables(var interval(continuous&amp;nbsp;_var1-continuous10)&amp;nbsp;&lt;/P&gt;&lt;P&gt;2) then merge 3 sets of distance matrix back with the values of 10 continuous&amp;nbsp;variables&lt;/P&gt;&lt;P&gt;3) Standardize them and use&amp;nbsp;standardized&amp;nbsp;variables&amp;nbsp;as the new variables in "PROC CLUSTER" or "PROC FASTCLUS"&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Question, Dose the logic make sense to you, particularly step 1&amp;nbsp;? Thank you.&lt;/P&gt;</description>
    <pubDate>Thu, 18 Jan 2018 16:42:34 GMT</pubDate>
    <dc:creator>lionking19063</dc:creator>
    <dc:date>2018-01-18T16:42:34Z</dc:date>
    <item>
      <title>MIxed type of variables in cluster analaysis</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/MIxed-type-of-variables-in-cluster-analaysis/m-p/428867#M22524</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I&amp;nbsp;am doing a cluster analysis with 10 continuous variables and 3 categorical variables.&amp;nbsp;Instead of converting&amp;nbsp;categorical variables into dummies, I am thinking of creating distance matrix&amp;nbsp;using "PROC DISTANCE".&lt;/P&gt;&lt;P&gt;1) Calculate 3 sets of distance&amp;nbsp;matrix and each set contains the distance between one&amp;nbsp;categorical variable(id category_var1)&amp;nbsp;and 10 continuous variables(var interval(continuous&amp;nbsp;_var1-continuous10)&amp;nbsp;&lt;/P&gt;&lt;P&gt;2) then merge 3 sets of distance matrix back with the values of 10 continuous&amp;nbsp;variables&lt;/P&gt;&lt;P&gt;3) Standardize them and use&amp;nbsp;standardized&amp;nbsp;variables&amp;nbsp;as the new variables in "PROC CLUSTER" or "PROC FASTCLUS"&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Question, Dose the logic make sense to you, particularly step 1&amp;nbsp;? Thank you.&lt;/P&gt;</description>
      <pubDate>Thu, 18 Jan 2018 16:42:34 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/MIxed-type-of-variables-in-cluster-analaysis/m-p/428867#M22524</guid>
      <dc:creator>lionking19063</dc:creator>
      <dc:date>2018-01-18T16:42:34Z</dc:date>
    </item>
    <item>
      <title>Re: MIxed type of variables in cluster analaysis</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/MIxed-type-of-variables-in-cluster-analaysis/m-p/428911#M22526</link>
      <description>&lt;P&gt;Instead, you could get clusters from continuous_var1-continuous_var10 and test for a relationship&amp;nbsp;between those clusters and&amp;nbsp;your categories with proc freq.&lt;/P&gt;</description>
      <pubDate>Thu, 18 Jan 2018 20:16:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/MIxed-type-of-variables-in-cluster-analaysis/m-p/428911#M22526</guid>
      <dc:creator>PGStats</dc:creator>
      <dc:date>2018-01-18T20:16:54Z</dc:date>
    </item>
    <item>
      <title>Re: MIxed type of variables in cluster analaysis</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/MIxed-type-of-variables-in-cluster-analaysis/m-p/428914#M22527</link>
      <description>&lt;P&gt;You are right. However, I really want to test the effects of categorical variables along with other continuous variables at the same time. Thank you for your response.&lt;/P&gt;</description>
      <pubDate>Thu, 18 Jan 2018 20:28:05 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/MIxed-type-of-variables-in-cluster-analaysis/m-p/428914#M22527</guid>
      <dc:creator>lionking19063</dc:creator>
      <dc:date>2018-01-18T20:28:05Z</dc:date>
    </item>
  </channel>
</rss>

