<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Grouping similar competitors in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Grouping-similar-competitors/m-p/106464#M5589</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;You will probably find that restructuring your data to: Origination, Destination, Airline and Price lends itself to better analysis in general.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Tue, 28 Aug 2012 19:15:13 GMT</pubDate>
    <dc:creator>ballardw</dc:creator>
    <dc:date>2012-08-28T19:15:13Z</dc:date>
    <item>
      <title>Grouping similar competitors</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Grouping-similar-competitors/m-p/106463#M5588</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Suppose we want to look at which airlines have similar pricing strategies. The data set looks like this:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Variables: Flight Origination, Flight Destination, Airline1 Price, Airline2 Price, ....Airline10 Price.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Data:&lt;/P&gt;&lt;P&gt;Origination: A, Destination: B, Airline1 Price=100, Airline2 = 120, ...., Airline10=95&lt;/P&gt;&lt;P&gt;Origination: A, Destination: C, Airline1 Price=500, Airline2 = 450, ...., Airline10=505&lt;/P&gt;&lt;P&gt;......&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I wonder if I can use PROC VARCLUS to accomplish this objective. But as I read, it seems PROC VARCLUS is not used for this purpose.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 28 Aug 2012 18:38:22 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Grouping-similar-competitors/m-p/106463#M5588</guid>
      <dc:creator>kurofufu</dc:creator>
      <dc:date>2012-08-28T18:38:22Z</dc:date>
    </item>
    <item>
      <title>Re: Grouping similar competitors</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Grouping-similar-competitors/m-p/106464#M5589</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;You will probably find that restructuring your data to: Origination, Destination, Airline and Price lends itself to better analysis in general.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 28 Aug 2012 19:15:13 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Grouping-similar-competitors/m-p/106464#M5589</guid>
      <dc:creator>ballardw</dc:creator>
      <dc:date>2012-08-28T19:15:13Z</dc:date>
    </item>
    <item>
      <title>Re: Grouping similar competitors</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Grouping-similar-competitors/m-p/106465#M5590</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;More info:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Expected outcome goes like this:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Airline1, Airline5, Airline10 have the same pricing strategies&lt;/P&gt;&lt;P&gt;Airline2, Airline3, Airline6 have the same pricing strategies&lt;/P&gt;&lt;P&gt;....&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;What I mean by the same pricing strategies is when Airline1 prices high on a origination and destination pair, Airline5 and Airline10 also prices high.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 29 Aug 2012 17:41:11 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Grouping-similar-competitors/m-p/106465#M5590</guid>
      <dc:creator>kurofufu</dc:creator>
      <dc:date>2012-08-29T17:41:11Z</dc:date>
    </item>
    <item>
      <title>Re: Grouping similar competitors</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Grouping-similar-competitors/m-p/106466#M5591</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;If you restructure your data the way that ballardw says, then you can apply the various clustering procedures to get to the type of answer that you are looking for.&amp;nbsp; This looks like a classic CART (Classification and regression tree) problem.&amp;nbsp; SAS Enterprise Miner has a CART procedure.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;You will probably need some additional information for a complete analysis (distance, number of stops, time of day, lead time to purchase, etc.).&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Doc Muhlbaier&lt;/P&gt;&lt;P&gt;Duke&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 29 Aug 2012 18:19:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Grouping-similar-competitors/m-p/106466#M5591</guid>
      <dc:creator>Doc_Duke</dc:creator>
      <dc:date>2012-08-29T18:19:43Z</dc:date>
    </item>
    <item>
      <title>Re: Grouping similar competitors</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Grouping-similar-competitors/m-p/106467#M5592</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;The example is just intended for illustrating my question. I don't plan to go into a full analysis of airline industry. I don't see restructuring the data in ballarsw's way can solve my problem.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: #333333; font-family: Verdana, Arial, Tahoma, Calibri, Geneva, sans-serif; background-color: #fafafa;"&gt;Another point is I want to group variables, not observations.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: #333333; font-family: Verdana, Arial, Tahoma, Calibri, Geneva, sans-serif; background-color: #fafafa;"&gt;Can I create a correlation matrix of Airline1 to Airline10 and figure out the groupings?&lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 29 Aug 2012 19:01:58 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Grouping-similar-competitors/m-p/106467#M5592</guid>
      <dc:creator>kurofufu</dc:creator>
      <dc:date>2012-08-29T19:01:58Z</dc:date>
    </item>
    <item>
      <title>Re: Grouping similar competitors</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Grouping-similar-competitors/m-p/106468#M5593</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;In principle, VARCLUS seems fine for your task, as long as every airline offers a flight between every O-D pair. O-D pairs with missing values will however be deleted from the analysis.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;An alternative would be to transpose your dataset and use proc DISTANCE to get a distance matrix with perhaps DCORR as a distance measure. Then you could use that distance matrix as input into proc CLUSTER or proc MODECLUS to get airline clusters. You could also simply feed your distance matrix into proc MDS and get a 2-D representation of the distances.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;PG&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 30 Aug 2012 01:03:03 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Grouping-similar-competitors/m-p/106468#M5593</guid>
      <dc:creator>PGStats</dc:creator>
      <dc:date>2012-08-30T01:03:03Z</dc:date>
    </item>
    <item>
      <title>Re: Grouping similar competitors</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Grouping-similar-competitors/m-p/106469#M5594</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thanks for the suggestion, PG.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I also would like to find out who price in the different direction than each other.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;So two patterns are interested:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;When A prices high/low, B prices high/low most of time =&amp;gt; group A &amp;amp; B together&lt;/P&gt;&lt;P&gt;When A prices high/low, B prices low/high most of time. =&amp;gt; group A &amp;amp; B together&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I think only correlation matrix fits the bill.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 30 Aug 2012 03:47:17 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Grouping-similar-competitors/m-p/106469#M5594</guid>
      <dc:creator>kurofufu</dc:creator>
      <dc:date>2012-08-30T03:47:17Z</dc:date>
    </item>
    <item>
      <title>Re: Grouping similar competitors</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Grouping-similar-competitors/m-p/106470#M5595</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I don't think so.&lt;/P&gt;&lt;P&gt;Correlation matrix is suited for those variables which are independent for each everyone.&lt;/P&gt;&lt;P&gt;There are some Partial correlation for one single variable. That means someone variable will be associated with more than one variables. That is the reason why we should use proc varclus to consider such a question, it has considered the Partial Correlation .&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Ksharp&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 30 Aug 2012 05:02:08 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Grouping-similar-competitors/m-p/106470#M5595</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2012-08-30T05:02:08Z</dc:date>
    </item>
    <item>
      <title>Re: Grouping similar competitors</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Grouping-similar-competitors/m-p/106471#M5596</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Ksharp,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;Pattern1: When A prices high/low, B prices high/low most of time =&amp;gt; group A &amp;amp; B together&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;Pattern2: When A prices high/low, B prices low/high most of time. =&amp;gt; group A &amp;amp; B together&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&lt;/P&gt;&lt;P&gt;Can proc varclus group variables with Pattern2 together? Or it only can group variables with Pattern1?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 30 Aug 2012 05:18:28 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Grouping-similar-competitors/m-p/106471#M5596</guid>
      <dc:creator>kurofufu</dc:creator>
      <dc:date>2012-08-30T05:18:28Z</dc:date>
    </item>
    <item>
      <title>Re: Grouping similar competitors</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Grouping-similar-competitors/m-p/106472#M5597</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;These two Patterns you are talking about is linear correlation (i.e. Pearson correlated coefficient ) which conform to Normal distribution (i.e. independent for each other). you can use correlation matrix, once you make sure that they are non-correlated (i.e. each airline make its decision is independent ) .&lt;/P&gt;&lt;P&gt;proc varclus is implementing multi-variables analysis, which is nothing with linear correlation. he use principal component analysis.&lt;/P&gt;&lt;P&gt;The first principal component is a weighted average of the variables that explains as much variance&lt;/P&gt;&lt;P&gt;as possible.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;For example: if A and B belong to a group that means A and B can explain most variance of either the correlation or the covariance matrix .&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;KSharp&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 30 Aug 2012 06:29:15 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Grouping-similar-competitors/m-p/106472#M5597</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2012-08-30T06:29:15Z</dc:date>
    </item>
  </channel>
</rss>

