<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to use Principal Components Analysis results to form a Wealth Indicator variable with quinti in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/How-to-use-Principal-Components-Analysis-results-to-form-a/m-p/667006#M31803</link>
    <description>&lt;P&gt;So you have reduced many original variables to 9 PCA variables, and the question is how to reduce this to one index? Well, as far as I know, there's no universal method or formula that gets you to the next step. Furthermore, the sign on the PCA vectors is arbitrary, and has no real meaning, so PCA vector 1 could be positive to indicate a high wealth index, or it could be negative to indicate a high wealth index, and there's no way in advance to know which it is. Or it could be that the principal components are completely or nearly completely unrelated to wealth; they don't necessarily have to have anything to do with wealth.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Typically, you might want to try to interpret the PCA vectors, gain some understanding of what vector 1 is telling you, and what vector 2 is telling you, &lt;EM&gt;etc&lt;/EM&gt;. by looking to see which variables have the high loadings (either positive or negative) in a given dimension. For example, if dimension 1 has a lot of variables that have big loadings (either positive or negative) that are associated with education, then you could interpret dimension 1 as an education variable. But even after that, how to get a wealth indicator is not clear.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If you want a predictive measure of wealth, then something like Partial Least Squares regression would be useful, its analogous to PCA but tries to find dimensions/vectors that are predictive. This assumes you have a Y variable to predict.\&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;P&gt;I would also need to divide the wealth indicator variable into quintiles, indicating household wealth based on the quintile the household belongs to.&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;PROC RANK with the option GROUPS=5&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Sun, 05 Jul 2020 14:40:46 GMT</pubDate>
    <dc:creator>PaigeMiller</dc:creator>
    <dc:date>2020-07-05T14:40:46Z</dc:date>
    <item>
      <title>How to use Principal Components Analysis results to form a Wealth Indicator variable with quintiles?</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/How-to-use-Principal-Components-Analysis-results-to-form-a/m-p/666967#M31801</link>
      <description>&lt;P&gt;Hello!&lt;/P&gt;&lt;P&gt;I have run a principal components analysis of various wealth indicator variables such as 'owning a radio', 'type of toilet used by household', etc with the goal of reducing the large amount of variables. I used proc factor with the PCA method which gave me 9 factors with an Eigenvalue &amp;gt;1. My question is, how do I go from 9 factors to one wealth indicator variable? I would also need to divide the wealth indicator variable into quintiles, indicating household wealth based on the quintile the household belongs to. I am still learning SAS, so any advice would be helpful! Here is my code for reference:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;proc factor data = bedlib.prinanalysis outstat = bedlib.prinfactor&lt;BR /&gt;simple scree corr score nfact = 9 method = principal;&lt;BR /&gt;var PIPED_WATER UNDER_WATER OWNED_TOILET OWNED_PIT SHARED_PIT ELECT_ENERGY&lt;BR /&gt;COAL_ENERGY WOOD_ENERGY GAS_ENERGY COWS_ MULES_ GOATS_ PIGS_ CHICKENS_ ANIMALCA_ BICYCLE_ CAR_&amp;nbsp; MOTORCYCLE_ RADIO_ ELECTRIC_ TELEVISI_ REFRIGER_ CELLPHON_ SOLARPAN_ COMPUTER_ STEREO_;&lt;BR /&gt;run;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 05 Jul 2020 00:30:36 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/How-to-use-Principal-Components-Analysis-results-to-form-a/m-p/666967#M31801</guid>
      <dc:creator>juliagscott</dc:creator>
      <dc:date>2020-07-05T00:30:36Z</dc:date>
    </item>
    <item>
      <title>Re: How to use Principal Components Analysis results to form a Wealth Indicator variable with quinti</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/How-to-use-Principal-Components-Analysis-results-to-form-a/m-p/666978#M31802</link>
      <description>&lt;P&gt;PCA will only identify which combinations of variables account for the greatest variability in your data. How will you interpret a factor containing &lt;EM&gt;0.1 * &amp;lt;number of cows&amp;gt; - 0.02 &amp;lt;number of goats&amp;gt;&lt;/EM&gt; ?&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I think you would get a far better indicator of wealth by multiplying each variable with the approximate value of the feature that it represents, such as &lt;EM&gt;&amp;lt;number of televisions&amp;gt; * &amp;lt;average value of a television&amp;gt;&lt;/EM&gt;. I understand that some of these values might be difficult to estimate, but I would suggest to omit those from your indicator as most of them must be correlated anyway.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Such an indicator would be much easier to interpret and justify.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I hope this helps.&lt;/P&gt;</description>
      <pubDate>Sun, 05 Jul 2020 02:56:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/How-to-use-Principal-Components-Analysis-results-to-form-a/m-p/666978#M31802</guid>
      <dc:creator>PGStats</dc:creator>
      <dc:date>2020-07-05T02:56:29Z</dc:date>
    </item>
    <item>
      <title>Re: How to use Principal Components Analysis results to form a Wealth Indicator variable with quinti</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/How-to-use-Principal-Components-Analysis-results-to-form-a/m-p/667006#M31803</link>
      <description>&lt;P&gt;So you have reduced many original variables to 9 PCA variables, and the question is how to reduce this to one index? Well, as far as I know, there's no universal method or formula that gets you to the next step. Furthermore, the sign on the PCA vectors is arbitrary, and has no real meaning, so PCA vector 1 could be positive to indicate a high wealth index, or it could be negative to indicate a high wealth index, and there's no way in advance to know which it is. Or it could be that the principal components are completely or nearly completely unrelated to wealth; they don't necessarily have to have anything to do with wealth.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Typically, you might want to try to interpret the PCA vectors, gain some understanding of what vector 1 is telling you, and what vector 2 is telling you, &lt;EM&gt;etc&lt;/EM&gt;. by looking to see which variables have the high loadings (either positive or negative) in a given dimension. For example, if dimension 1 has a lot of variables that have big loadings (either positive or negative) that are associated with education, then you could interpret dimension 1 as an education variable. But even after that, how to get a wealth indicator is not clear.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If you want a predictive measure of wealth, then something like Partial Least Squares regression would be useful, its analogous to PCA but tries to find dimensions/vectors that are predictive. This assumes you have a Y variable to predict.\&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;P&gt;I would also need to divide the wealth indicator variable into quintiles, indicating household wealth based on the quintile the household belongs to.&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;PROC RANK with the option GROUPS=5&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 05 Jul 2020 14:40:46 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/How-to-use-Principal-Components-Analysis-results-to-form-a/m-p/667006#M31803</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2020-07-05T14:40:46Z</dc:date>
    </item>
  </channel>
</rss>

