<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Variable reduction in an unsupervised dataset in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Variable-reduction-in-an-unsupervised-dataset/m-p/193593#M2448</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hey Minal,&lt;/P&gt;&lt;P&gt;1000 inputs do not seem like a lot, so I think you are good to use the Cluster or HPCluster nodes just on those inputs. I am not very clear on what are you planning to do with the zip codes. Were you planning to run a cluster node on your 1000 inputs and then compare those clusters to your zip codes? Or what was your plan?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;You can use the Variable Cluster and the Principal Component nodes in Enterprise Miner for dimension reduction but I am not sure that you need that.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Good luck!&lt;/P&gt;&lt;P&gt;-Miguel&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Wed, 01 Jul 2015 16:47:30 GMT</pubDate>
    <dc:creator>M_Maldonado</dc:creator>
    <dc:date>2015-07-01T16:47:30Z</dc:date>
    <item>
      <title>Variable reduction in an unsupervised dataset</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Variable-reduction-in-an-unsupervised-dataset/m-p/193592#M2447</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;&lt;BR /&gt;Hello Everyone,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; I have a dataset which has about 1000 variables (all are numerical) and is unsupervised(has no target variable). It has a column "zipcode" and my goal is to form meaningful clusters based on this dataset to analyze the association between the zip codes . I was looking to reduce the number of variables (dimensionality reduction) so that I can pass the reduced dataset to PROC Varclus . Is there any effective Procedure for dimensionality reduction for unsupervised datasets? I am using Enterprise Miner and Enterprise Guide. Any related&amp;nbsp; response would be of great help.&amp;nbsp; Thankyou!&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 01 Jul 2015 14:43:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Variable-reduction-in-an-unsupervised-dataset/m-p/193592#M2447</guid>
      <dc:creator>MinalMMurkhande</dc:creator>
      <dc:date>2015-07-01T14:43:43Z</dc:date>
    </item>
    <item>
      <title>Re: Variable reduction in an unsupervised dataset</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Variable-reduction-in-an-unsupervised-dataset/m-p/193593#M2448</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hey Minal,&lt;/P&gt;&lt;P&gt;1000 inputs do not seem like a lot, so I think you are good to use the Cluster or HPCluster nodes just on those inputs. I am not very clear on what are you planning to do with the zip codes. Were you planning to run a cluster node on your 1000 inputs and then compare those clusters to your zip codes? Or what was your plan?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;You can use the Variable Cluster and the Principal Component nodes in Enterprise Miner for dimension reduction but I am not sure that you need that.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Good luck!&lt;/P&gt;&lt;P&gt;-Miguel&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 01 Jul 2015 16:47:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Variable-reduction-in-an-unsupervised-dataset/m-p/193593#M2448</guid>
      <dc:creator>M_Maldonado</dc:creator>
      <dc:date>2015-07-01T16:47:30Z</dc:date>
    </item>
    <item>
      <title>Re: Variable reduction in an unsupervised dataset</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Variable-reduction-in-an-unsupervised-dataset/m-p/193594#M2449</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi Miguel,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; Thank you for your reply. Yes, I was planning to run the cluster node on 1000 inputs and then compare/map the observations with the respective zip codes. FYI, each observation is identified by a unique zip code. This is the only method that I could guess. Is there any other efficient method or procedure for dimensionality reduction in an unsupervised dataset other than using the Cluster node?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;-Regards,&lt;/P&gt;&lt;P&gt;Minal.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 01 Jul 2015 18:02:44 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Variable-reduction-in-an-unsupervised-dataset/m-p/193594#M2449</guid>
      <dc:creator>MinalMMurkhande</dc:creator>
      <dc:date>2015-07-01T18:02:44Z</dc:date>
    </item>
  </channel>
</rss>

