Dear SAS community members, I am working in scientific research. I would like to run simultaneously correlations between one variable (eg blood cholesterol) and thousand of others (names of thousand genes). The name of the variables look like this (and continues up to some thousands): TC01000005_hg_1, TC01000006_hg_1, TC01000008_hg_1, TC01000009_hg_1, TC01000010_hg_1, TC01000011_hg_1, TC01000012_hg_1, TC01000013_hg_1, TC01000014_hg_1, TC01000015_hg_1, TC01000016_hg_1, TC01000017_hg_1, TC01000018_hg_1, TC01000019_hg_1 My first question is how I should type the command so that I can include all those thousand variables. I have seen a syntax like the following; proc corr data=myData; var Var1; with var2-var99; run; But how should I transform it in order to include my type of variables? My second question is how I should type the "BEST" command so that, after running the thousands correlations, I could have a list of the top 50 results. This should include the 50 variable names with the highest (or lowest) R value and the lowest p value. My third question, is it possible to run this type of correlations (one variable against 30.000 variables) in SAS university edition. I have assigned 7GB out of 8 total GB RAM for the VM Box. Thank you so much in advance!! Regards Apo
... View more