Programming the statistical procedures from SAS

How to use individual principal component scores to compute group scores

Accepted Solution Solved
Reply
Contributor
Posts: 27
Accepted Solution

How to use individual principal component scores to compute group scores

Hello,

 

I have calculated principal components for hundreds of respondents with an average of ten variables using Proc Factor. I was trying to use these individual principal components to compute single group measure for each component.

 

For instance, I have got 10 schools and 20 students per school and I got two principal components for the 10 variables.

How can I use these two components for each school?

Can I sum up comp1 to get one school measure for component one and take the average?

Do the same thing for comp2 and get one measure for each schools?

 

Component A = (sum of all comp1 for each student in school X) / (Total students in school X)

Component B = (sum of all comp2 for each student in school X) / (Total students in school X)

 

Regards

Teketo


Accepted Solutions
Solution
‎04-27-2018 04:10 AM
Respected Advisor
Posts: 3,066

Re: How to use individual principal component scores to compute group scores

[ Edited ]

How can I use these two components for each school?

That's really up to you. What would you like to learn from the data? You haven't told us.

 

Principal components are often used to help visualize data in lower dimensional space so you can detect patterns or clustering. Is that what you want?

 

Tell us what you want to learn from this data.

 

Can I sum up comp1 to get one school measure for component one and take the average?

Do the same thing for comp2 and get one measure for each schools?

 

Component A = (sum of all comp1 for each student in school X) / (Total students in school X)

Component B = (sum of all comp2 for each student in school X) / (Total students in school X)

 

Yes you can do this. However, taken exactly as you have described the question of comparing school averages, this is a situation where you would use MANOVA, rather than Principal Components.

 

So again, we need to know what you want to learn from this data.

--
Paige Miller

View solution in original post


All Replies
Super User
Posts: 23,778

Re: How to use individual principal component scores to compute group scores

You could, I'm assuming the 'groups' are predetermined? 

I'm assuming you would graph it first to see if the distribution made sense between your groups? I think you're trying to SCORE your data. PROC SCORE has an example of how to get the principle component scores for your data, a fully worked example. 

 


@Teketo wrote:

Hello,

 

I have calculated principal components for hundreds of respondents with an average of ten variables using Proc Factor. I was trying to use these individual principal components to compute single group measure for each component.

 

For instance, I have got 10 schools and 20 students per school and I got two principal components for the 10 variables.

How can I use these two components for each school?

Can I sum up comp1 to get one school measure for component one and take the average?

Do the same thing for comp2 and get one measure for each schools?

 

Component A = (sum of all comp1 for each student in school X) / (Total students in school X)

Component B = (sum of all comp2 for each student in school X) / (Total students in school X)

 

Regards

Teketo


 

Respected Advisor
Posts: 3,066

Re: How to use individual principal component scores to compute group scores


@Reeza wrote:

You could, I'm assuming the 'groups' are predetermined? 

I'm assuming you would graph it first to see if the distribution made sense between your groups? I think you're trying to SCORE your data. PROC SCORE has an example of how to get the principle component scores for your data, a fully worked example. 



PROC SCORE is not really needed here, one of the output data sets from PROC PRINCOMP will contain the PCA scores. The only time PROC SCORE would be useful is to compute scores on observations that were not in the original data set.

--
Paige Miller
Solution
‎04-27-2018 04:10 AM
Respected Advisor
Posts: 3,066

Re: How to use individual principal component scores to compute group scores

[ Edited ]

How can I use these two components for each school?

That's really up to you. What would you like to learn from the data? You haven't told us.

 

Principal components are often used to help visualize data in lower dimensional space so you can detect patterns or clustering. Is that what you want?

 

Tell us what you want to learn from this data.

 

Can I sum up comp1 to get one school measure for component one and take the average?

Do the same thing for comp2 and get one measure for each schools?

 

Component A = (sum of all comp1 for each student in school X) / (Total students in school X)

Component B = (sum of all comp2 for each student in school X) / (Total students in school X)

 

Yes you can do this. However, taken exactly as you have described the question of comparing school averages, this is a situation where you would use MANOVA, rather than Principal Components.

 

So again, we need to know what you want to learn from this data.

--
Paige Miller
☑ This topic is solved.

Need further help from the community? Please ask a new question.

Discussion stats
  • 3 replies
  • 197 views
  • 3 likes
  • 3 in conversation