BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
Olga_Shest
Calcite | Level 5

Hello SAS community,

Can somebody suggest how to create a set of all possible pairs of observations (Cartesian product) without using proc sql?

Say I have a data set test1:

data test1;

   do x=1 to 3;

      y=x+10; output;

   end;
run;

I would like to create a data set test2 that contains all possible combinations of x and y:

data test2;

   input x y;

   datalines;

      1  11

      1  12

      1  13

      2  11

      2  12

      2  13

      3  11

      3  12

      3  13

;

run;

1 ACCEPTED SOLUTION

Accepted Solutions
Tom
Super User Tom
Super User

Not sure I understand because your examples don't match the simple definition of all possible pairs.

Instead consider if you have dataset X with all possible values of variable X.  Similarly for Y.

So for each observation in X read in all observations in Y.

data want ;

    set x;

    do _n_=1 to nobs;

       set y nobs=nobs point=_n_;

      output;

   end;

run;

View solution in original post

8 REPLIES 8
Haikuo
Onyx | Level 15

Here is one approach:

data test1;

   do x=1 to 3;

      y=x+10; output;

   end;

run;

data test2;

   set test1;

   do i=1 to nobs;

      set test1 (keep=y) point=i nobs=nobs;

output;

    end;

run;

proc print;run;

Regards,

Haikuo

Tom
Super User Tom
Super User

Not sure I understand because your examples don't match the simple definition of all possible pairs.

Instead consider if you have dataset X with all possible values of variable X.  Similarly for Y.

So for each observation in X read in all observations in Y.

data want ;

    set x;

    do _n_=1 to nobs;

       set y nobs=nobs point=_n_;

      output;

   end;

run;

Hima
Obsidian | Level 7

data test1;
do x=1 to 3;
y=x+10; output;
end;
run;

data test2;
set test1;
drop y;
run;

proc print noobs; run;

data test3;
set test1;
drop x;
run;

proc print noobs; run;

proc sql;
select x,y from test2, test3;
quit;

Output:

                      x         y
                                       ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ
                                              1        11
                                              1        12
                                              1        13
                                              2        11
                                              2        12
                                              2        13
                                              3        11
                                              3        12
                                              3        13


Olga_Shest
Calcite | Level 5

Thank you Hima. Your approach requires proc sql though...

Haikuo
Onyx | Level 15

Just for fun, here is a hash approach,

data test2 (drop=_:);

  if _n_=1 then do;

    set test1(obs=1);

dcl hash h(dataset: 'test1', ordered: 'a');

h.definekey('y');

h.definedata('y');

h.definedone();

dcl hiter hi('h');

   end;

   set test1;

      do _rc=hi.first() by 0 while (_rc=0);

output;

_rc=hi.next();

end;

run;

Olga_Shest
Calcite | Level 5

Thank you Haikuo. Would you care to elaborate what are advantages of the hash approch you demonstrated over the "stacking y's for all x's" approach that you and Tom posted?

Haikuo
Onyx | Level 15

Hash in general is more efficient for saving I/O time by process records completely in memory. I can't speak for this case, you will have to do a benchmark.

Haikuo

Olga_Shest
Calcite | Level 5

Thank you everyone for posting!

hackathon24-white-horiz.png

Join the 2025 SAS Hackathon!

Calling all data scientists and open-source enthusiasts! Want to solve real problems that impact your company or the world? Register to hack by August 31st!

Register Now

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 8 replies
  • 4890 views
  • 6 likes
  • 4 in conversation