<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Weighting the variables to be used in Clustering in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Weighting-the-variables-to-be-used-in-Clustering/m-p/446307#M6810</link>
    <description>&lt;P&gt;Dears,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Kindly note that I need to segment the customers based on behavioral monthly sales and I have 7 variables for clustering and segmenting the customers but some variables are higher than others on importance and weights.This means that if I have Variable called &lt;U&gt;&lt;STRONG&gt;Total_sales_per_month&lt;/STRONG&gt;&lt;/U&gt; and&amp;nbsp; this variable has 40% weights (to contribute with 40% in clustering and decisions).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I need to know how to apply this in SAS Enterprise miner or sas procedures, does this method correctly or there another way in machine learning with the same idea .&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please support me to resolve and satisfy the business needs.&amp;nbsp;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 16 Mar 2018 18:55:11 GMT</pubDate>
    <dc:creator>husseinmazaar</dc:creator>
    <dc:date>2018-03-16T18:55:11Z</dc:date>
    <item>
      <title>Weighting the variables to be used in Clustering</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Weighting-the-variables-to-be-used-in-Clustering/m-p/446307#M6810</link>
      <description>&lt;P&gt;Dears,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Kindly note that I need to segment the customers based on behavioral monthly sales and I have 7 variables for clustering and segmenting the customers but some variables are higher than others on importance and weights.This means that if I have Variable called &lt;U&gt;&lt;STRONG&gt;Total_sales_per_month&lt;/STRONG&gt;&lt;/U&gt; and&amp;nbsp; this variable has 40% weights (to contribute with 40% in clustering and decisions).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I need to know how to apply this in SAS Enterprise miner or sas procedures, does this method correctly or there another way in machine learning with the same idea .&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Please support me to resolve and satisfy the business needs.&amp;nbsp;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Mar 2018 18:55:11 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Weighting-the-variables-to-be-used-in-Clustering/m-p/446307#M6810</guid>
      <dc:creator>husseinmazaar</dc:creator>
      <dc:date>2018-03-16T18:55:11Z</dc:date>
    </item>
    <item>
      <title>Re: Weighting the variables to be used in Clustering</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Weighting-the-variables-to-be-used-in-Clustering/m-p/448784#M6832</link>
      <description>&lt;P&gt;Hello husseinmazaar-&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;When computing distances, variables that have a larger variance have a greater contribution to the distance computation.&amp;nbsp; For that reason, variables are often standardized so that all variables have equal importance.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;In your case, you might choose to standardize all variables except total_sales_per_month so that they have equal variance, and then assign a different and larger variance to total_sales_per_month.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Run PROC STDIZE (a SAS/STAT procedure) twice for the scenario that you describe.&amp;nbsp; Use one run to standardize all variables to have the same variance (standard deviation, scale).&amp;nbsp; Use the second run to specify a MULT= value for just the total_sales_per_month variable so that it has a larger scale.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://support.sas.com/documentation/onlinedoc/stat/" target="_blank"&gt;https://support.sas.com/documentation/onlinedoc/stat/&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;When you run your cluster analysis, be sure to turn OFF any automatic standardization so that your custom standardization is used.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Have a great week!&lt;/P&gt;</description>
      <pubDate>Mon, 26 Mar 2018 19:11:34 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Weighting-the-variables-to-be-used-in-Clustering/m-p/448784#M6832</guid>
      <dc:creator>MikeStockstill</dc:creator>
      <dc:date>2018-03-26T19:11:34Z</dc:date>
    </item>
    <item>
      <title>Re: Weighting the variables to be used in Clustering</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Weighting-the-variables-to-be-used-in-Clustering/m-p/449595#M6842</link>
      <description>&lt;P&gt;Thanks so much.&lt;/P&gt;</description>
      <pubDate>Thu, 29 Mar 2018 08:21:28 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Weighting-the-variables-to-be-used-in-Clustering/m-p/449595#M6842</guid>
      <dc:creator>husseinmazaar</dc:creator>
      <dc:date>2018-03-29T08:21:28Z</dc:date>
    </item>
  </channel>
</rss>

