Statistical Procedures

carbs · Posted 04-01-2012 05:37 AM

Hi,

I have a data including 20 countries' observations for four variables, x1,x2,x3 and x4. I would like to get mean values for the correlations between countries for pairs x1&x2, x1&x3,1&4,2&3,2&4,3&4. I have all the data in one file, and I know proc corr can be used in calculating the correlations, but didn't figure out how to be able to modify the code so that I'd get it to calculate the mean correlations like suggested?

Doc_Duke · Posted 04-02-2012 03:00 PM

Actually, I am not sure that your question is well specified. You have twenty data points with 4 values for each (20 rows and 4 columns). If you do the initial correlations that you specify, then that is ACROSS all countries, not BETWEEN any pair of them.

carbs · Posted 04-02-2012 05:36 PM

Sorry for the inaccuracy in my question.. Yes I meant across all countries..

carbs · Posted 04-17-2012 02:51 PM

Any ideas still?

Tom · Posted 04-17-2012 03:36 PM

Post some actual example data so that someone can try to understand the question.

carbs · Posted 04-17-2012 03:54 PM

I attached a data sample.. There's four variables, and each country has their own values for those variables.. Now I'd like to calculate mean correlations for those variables.. By mean correlations I mean what is the average correlation between two of the variables across 20 countries.. Hope this helps

Reeza · Posted 04-17-2012 04:13 PM

Can you post what you'd expect the output to look like? I'm still a bit confused on the 'across 20 countries' part.

You'll definitely need to change your data structure as well, do you already have that done?

carbs · Posted 04-17-2012 04:35 PM

Attached a table that represents the results I want to have as well (the average part).. Haven't yet modified my data, so it's just in the form than in the excel file attached..

PGStats · Posted 04-17-2012 04:18 PM

I didn't open your Excel file for security reasons and the usefulness of mean correlation is foreign to me but the following example might still show you how to calculate just that : the mean of all possible correlations in a set of variables :

data test;
array x{4};
do country = 1 to 20;
do i = 1 to 4;
X{i} = rannor(-1); /* Random numbers for the test */
end;
output;
end;
run;

proc corr data=test outp=testc;
var X:;
run;

proc transpose data=testc(where=(_TYPE_="CORR"))
name=VAR
out=testt(where=(VAR>_NAME_))
prefix=CORR;
var X:;
by _NAME_ notsorted;
run;

title "Mean correlation";
proc sql;
select mean(CORR1) as meanCorr from testt;
drop table testc, testt;
quit;

PG

Ksharp · Posted 04-18-2012 04:40 AM

How about:

proc import datafile='c:\sasforums.xls' out=have dbms=excel replace ;getnames=yes ;run;
data _null_;
dsid=open("have","i");
num=attrn(dsid,"nvars");
do i=2 to num by 5;
 n+1;
 name=catx(',',varname(dsid,1),varname(dsid,i),varname(dsid,i+1),varname(dsid,i+2),varname(dsid,i+3),varname(dsid,i+4) );
 call symputx(cats('name',n),name);
end;
  call symputx('n',n);
rc=close(dsid);
run;
%macro stack;
proc sql;
create table want as 
%do i=1 %to &n;
 select &&name&i from have
 %if &i ne &n %then %do; union all  %end;
%end;
 ;
quit;
%mend stack;

 %stack
proc sort data=want;by country;run;
proc corr data=want outp=want1(where=(_TYPE_="CORR"))  noprint;
  by country;
  var _numeric_;
run;
data want2(keep=country _name_ name corr);
 set want1;
 array _a{*} _numeric_;
 do i=1 to dim(_a);
  if _a{i}=1 then leave;
  name=vname(_a{i});corr=_a{i};output;
 end;
run;
proc sql noprint;
 select distinct catt('want2(where=(country="',country,'") rename=(corr=corr_',scan(country,-1),'))') into : list separated by ' '
  from want2;
quit;

proc sort data=want2;by country _name_ name;run;
data want3(drop=country);
 merge &list ;
 by _name_ name;
 mean_corr=mean(of corr_:);
run;

Ksharp

carbs · Posted 04-22-2012 03:29 AM

PGstats and Ksharp, thanks!

Statistical Procedures

mean correlations across 80 variables

mean correlations across 80 variables

Re: mean correlations across 80 variables

Re: mean correlations across 80 variables

Re: mean correlations across 80 variables

Re: mean correlations across 80 variables

Re: mean correlations across 80 variables

Re: mean correlations across 80 variables

Re: mean correlations across 80 variables

Re: mean correlations across 80 variables

Re: mean correlations across 80 variables

Simulate Correlated Multivariate Binary Variables

[SAS 활용 노하우] 상관계수(Coefficient of correlation)

Correlation

Correlate variable from two dataset

Correlation between interval variables and binary variables

Follow Us

What is...

Statistical Procedures

Join us for our biggest event of the year!

Follow Us

What is...