<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: proc varclus? in SAS Procedures</title>
    <link>https://communities.sas.com/t5/SAS-Procedures/proc-varclus/m-p/12075#M1556</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;VARCLUS is related to principle/centroid components, thus can be affected by the scale on which the variables are measured.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;OL start="1"&gt;&lt;LI&gt;Since you specified the CENTROID option, these are unweighted averages of the standardized variables (stated in documentation), you don't need to pre-standardize them.&amp;nbsp; To convince yourself, you can standardize them yourself and then run VARCLUS to see if the result is any different.&lt;/LI&gt;&lt;LI&gt;The easiest way to specify all numeric variables is "var _numeric_;" which is a list of all numeric variables.&amp;nbsp; You can use the drop= option on the input dataset to remove a few variables if necessary "proc varclus data=xxx(drop=v1 v2 v3) centroid; var _numeric_;" if this saves you typing.&lt;/LI&gt;&lt;/OL&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Tue, 20 Sep 2011 15:36:13 GMT</pubDate>
    <dc:creator>DLing</dc:creator>
    <dc:date>2011-09-20T15:36:13Z</dc:date>
    <item>
      <title>proc varclus?</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-varclus/m-p/12074#M1555</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P class="yiv38135044MsoNormal"&gt;Hi Collegues,&lt;/P&gt;&lt;P class="yiv38135044MsoNormal"&gt;&lt;/P&gt;&lt;P class="yiv38135044MsoNormal"&gt;I have 750 variables related to customers’ transaction behavior, account balances etc.&lt;/P&gt;&lt;P class="yiv38135044MsoNormal"&gt;&lt;/P&gt;&lt;P class="yiv38135044MsoNormal"&gt;I have reduced the number of variables using proc varclus. Then the representative variables in each cluster&amp;nbsp; were inputted into proc logistic to identify predictors that are associated with acquisition of bank products.&lt;/P&gt;&lt;P class="yiv38135044MsoNormal"&gt;&lt;/P&gt;&lt;P class="yiv38135044MsoNormal"&gt;I have two specific questions.&lt;/P&gt;&lt;P class="yiv38135044MsoNormal"&gt;&lt;/P&gt;&lt;P class="yiv38135044MsoListParagraphCxSpFirst"&gt;&lt;SPAN&gt;1.&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;/SPAN&gt;Should I standardize the input variables separately before inputting into proc varclus?&lt;/P&gt;&lt;P class="yiv38135044MsoListParagraphCxSpLast"&gt;&lt;SPAN&gt;2.&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;/SPAN&gt;I used the following statements. Is there any method that I can use so that I do not need to list down the name of all 750 variables?&lt;/P&gt;&lt;P class="yiv38135044MsoNormal"&gt;&lt;/P&gt;&lt;P class="yiv38135044MsoNormal"&gt;proc varclus data=temp.dataset&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; outtree=tree&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; centroid&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; maxclusters=25;&lt;/P&gt;&lt;P class="yiv38135044MsoNormal"&gt;var&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; a&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; b&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; c&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; d&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; e..............................................name of 750th variable ;&lt;/P&gt;&lt;P class="yiv38135044MsoNormal"&gt;run;&lt;/P&gt;&lt;P class="yiv38135044MsoNormal"&gt;&lt;/P&gt;&lt;P class="yiv38135044MsoNormal"&gt;Would really appreciate any ideas.&lt;/P&gt;&lt;P class="yiv38135044MsoNormal"&gt;&lt;/P&gt;&lt;P class="yiv38135044MsoNormal"&gt;Mirisage&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 20 Sep 2011 02:42:56 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-varclus/m-p/12074#M1555</guid>
      <dc:creator>Fernando</dc:creator>
      <dc:date>2011-09-20T02:42:56Z</dc:date>
    </item>
    <item>
      <title>Re: proc varclus?</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-varclus/m-p/12075#M1556</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;VARCLUS is related to principle/centroid components, thus can be affected by the scale on which the variables are measured.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;OL start="1"&gt;&lt;LI&gt;Since you specified the CENTROID option, these are unweighted averages of the standardized variables (stated in documentation), you don't need to pre-standardize them.&amp;nbsp; To convince yourself, you can standardize them yourself and then run VARCLUS to see if the result is any different.&lt;/LI&gt;&lt;LI&gt;The easiest way to specify all numeric variables is "var _numeric_;" which is a list of all numeric variables.&amp;nbsp; You can use the drop= option on the input dataset to remove a few variables if necessary "proc varclus data=xxx(drop=v1 v2 v3) centroid; var _numeric_;" if this saves you typing.&lt;/LI&gt;&lt;/OL&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 20 Sep 2011 15:36:13 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-varclus/m-p/12075#M1556</guid>
      <dc:creator>DLing</dc:creator>
      <dc:date>2011-09-20T15:36:13Z</dc:date>
    </item>
    <item>
      <title>Re: proc varclus?</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-varclus/m-p/12076#M1557</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt; Hi Dling,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thank you very much for this very useful clarification.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Best regards&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Fernando&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 20 Sep 2011 21:34:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-varclus/m-p/12076#M1557</guid>
      <dc:creator>Fernando</dc:creator>
      <dc:date>2011-09-20T21:34:29Z</dc:date>
    </item>
  </channel>
</rss>

