Hi Everyone,
I have one dataset which has one variable, containing a person's name and another dataset with a list of training courses and I was wondering if there is a quick way to create a third dataset which would contain a record for each possible person and training combination. The two datasets do not have a common variable. Keep in mind that we may not know how many of each we have at any time, and the data can change constantly. For example:
Dataset A:
Name
Abby
Bobby
Chris
Dataset B:
Training
Training #1
Training #2
Desired Dataset:
Name Training
Abby Training #1
Abby Training #2
Bobby Training #1
Bobby Training #2
Chris Training #1
Chris Training #2
Any thoughts?
Thanks!
-Phil
It's called a cartesian join or cross join.
proc sql;
create table want as
select *
from a,b;
quit;
It's called a cartesian join or cross join.
proc sql;
create table want as
select *
from a,b;
quit;
Thanks! That worked perfectly! It did give me a "NOTE: The execution of this query involves performing one or
more Cartesian product joins that can not be optimized." message, but it did seem to create all of the desired records.
data a;
input name $;
cards;
abby
bobby
chris
;
run;
data b;
length training $ 11;
input training &;
cards;
training #1
training #2
;
run;
data c;
set a;
do _n_=1 to nobs;
set b nobs=nobs point=_n_;
output;
end;
run;
April 27 – 30 | Gaylord Texan | Grapevine, Texas
Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!
Still thinking about your presentation idea? The submission deadline has been extended to Friday, Nov. 14, at 11:59 p.m. ET.
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.