Principal components for 10000 variable dataset

Reply
New Contributor
Posts: 4

Principal components for 10000 variable dataset

Hi all,

I have a dataset which i need to run a clustering algorithm on. First step is to reduce the dataset's dimensions as it has 10400 variables. I use PROC PRINCOMP but the resulting files are so huge they wont open in SAS Results Viewer

Does anyone have a suggestion on how i can access the eigenvalues/eigenvectors tables resulting from the PRINCOMP procedure.

Thanks in advance

Super Contributor
Posts: 340

Re: Principal components for 10000 variable dataset

Not sure if I understood your question properly, but if you are simply looking for a way to check where the result data is use "ods trace on" to find out the name (SAS(R) 9.2 Output Delivery System: User's Guide) and "ods output" to get the data.

New Contributor
Posts: 4

Re: Principal components for 10000 variable dataset

Posted in reply to user24feb

Thank you. I will try your suggestions. Sorry my question wasnt very clear. I think due to the size of the result data, SAS output just fails to open the results which are over 1GB is size.

Respected Advisor
Posts: 2,655

Re: Principal components for 10000 variable dataset

Try:

ods listing close;

ods html close;

ods output eigenvalues=eigenvalues eigenvectors=eigenvectors;

proc princomp ....

run;

ods listing;

ods html;

You can dispense with the html stuff if you are doing this in batch.

Steve Denham

New Contributor
Posts: 4

Re: Principal components for 10000 variable dataset

Posted in reply to SteveDenham

Thanks Steve,

This worked worked. Managed to read the eigenvectors/eigenvalues data from the temp library.

Much appreciated,

R

Super User
Posts: 10,041

Re: Principal components for 10000 variable dataset

or try Cronbach's coefficient Alpha of proc corr , Check the example 5 in its documentation.

or try proc varclus

Trusted Advisor
Posts: 1,228

Re: Principal components for 10000 variable dataset

Do you want to run clustering algorithm on original variables or on principal components?

New Contributor
Posts: 4

Re: Principal components for 10000 variable dataset

@ stat@sas ... i want to run clustering on principal components

Just want to say thanks to you all for your responses. I have managed to access the proc princomp results now.

Many thanks guys,

R

Ask a Question
Discussion stats
  • 7 replies
  • 288 views
  • 3 likes
  • 5 in conversation