I have data set with 10 binary variables (var1-var10), and about 300 observations. I want to output 9 data sets: Data set 1: all combinations of the 10 variables taken 2 variables at a time (10 choose 2), so 45 rows Data set 2: 10 choose 3, so 120 rows .... Data set 9: 10 choose 10, so just 1 row Each data set will include all var1-var10, and percent variable that shows how many observations =1 for each combination in data. For example (for data set of 10 choose 2): var1 var2 var3 .... var10 Percent 1 1 3% (percent of 300 observations where var1=1 and var2=1) 1 1 70% 1 1 5% .... showing all possible combinations of 10 choose 2 Best way to do this, and can be applied to larger number of variables that would produce larger data sets (i.e. 30 variables, taken 15 at a time = >150 million obs)?
... View more