BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
Teketo
Calcite | Level 5

Hello,

 

I have calculated principal components for hundreds of respondents with an average of ten variables using Proc Factor. I was trying to use these individual principal components to compute single group measure for each component.

 

For instance, I have got 10 schools and 20 students per school and I got two principal components for the 10 variables.

How can I use these two components for each school?

Can I sum up comp1 to get one school measure for component one and take the average?

Do the same thing for comp2 and get one measure for each schools?

 

Component A = (sum of all comp1 for each student in school X) / (Total students in school X)

Component B = (sum of all comp2 for each student in school X) / (Total students in school X)

 

Regards

Teketo

1 ACCEPTED SOLUTION

Accepted Solutions
PaigeMiller
Diamond | Level 26

How can I use these two components for each school?

That's really up to you. What would you like to learn from the data? You haven't told us.

 

Principal components are often used to help visualize data in lower dimensional space so you can detect patterns or clustering. Is that what you want?

 

Tell us what you want to learn from this data.

 

Can I sum up comp1 to get one school measure for component one and take the average?

Do the same thing for comp2 and get one measure for each schools?

 

Component A = (sum of all comp1 for each student in school X) / (Total students in school X)

Component B = (sum of all comp2 for each student in school X) / (Total students in school X)

 

Yes you can do this. However, taken exactly as you have described the question of comparing school averages, this is a situation where you would use MANOVA, rather than Principal Components.

 

So again, we need to know what you want to learn from this data.

--
Paige Miller

View solution in original post

3 REPLIES 3
Reeza
Super User

You could, I'm assuming the 'groups' are predetermined? 

I'm assuming you would graph it first to see if the distribution made sense between your groups? I think you're trying to SCORE your data. PROC SCORE has an example of how to get the principle component scores for your data, a fully worked example. 

 


@Teketo wrote:

Hello,

 

I have calculated principal components for hundreds of respondents with an average of ten variables using Proc Factor. I was trying to use these individual principal components to compute single group measure for each component.

 

For instance, I have got 10 schools and 20 students per school and I got two principal components for the 10 variables.

How can I use these two components for each school?

Can I sum up comp1 to get one school measure for component one and take the average?

Do the same thing for comp2 and get one measure for each schools?

 

Component A = (sum of all comp1 for each student in school X) / (Total students in school X)

Component B = (sum of all comp2 for each student in school X) / (Total students in school X)

 

Regards

Teketo


 

PaigeMiller
Diamond | Level 26

@Reeza wrote:

You could, I'm assuming the 'groups' are predetermined? 

I'm assuming you would graph it first to see if the distribution made sense between your groups? I think you're trying to SCORE your data. PROC SCORE has an example of how to get the principle component scores for your data, a fully worked example. 



PROC SCORE is not really needed here, one of the output data sets from PROC PRINCOMP will contain the PCA scores. The only time PROC SCORE would be useful is to compute scores on observations that were not in the original data set.

--
Paige Miller
PaigeMiller
Diamond | Level 26

How can I use these two components for each school?

That's really up to you. What would you like to learn from the data? You haven't told us.

 

Principal components are often used to help visualize data in lower dimensional space so you can detect patterns or clustering. Is that what you want?

 

Tell us what you want to learn from this data.

 

Can I sum up comp1 to get one school measure for component one and take the average?

Do the same thing for comp2 and get one measure for each schools?

 

Component A = (sum of all comp1 for each student in school X) / (Total students in school X)

Component B = (sum of all comp2 for each student in school X) / (Total students in school X)

 

Yes you can do this. However, taken exactly as you have described the question of comparing school averages, this is a situation where you would use MANOVA, rather than Principal Components.

 

So again, we need to know what you want to learn from this data.

--
Paige Miller

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 3 replies
  • 1421 views
  • 3 likes
  • 3 in conversation