BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
Davanden
Obsidian | Level 7

This is probably an easy question for people accustomed to SQL programming, but, alas, I am not.

 

Have

  • AHS87: Control, Status, lots of other variables....
  • AHS89: Control, Status weight, lots of other variables...

Want

Linked:  Control, Status87, Status89, weight no other variables

 

In other words, I want to link AHS87 and AHS89 by matching on Control.  I keep the Status variable from each dataset and drop everything else except the weight from AHS89.

 

I could do this in a DATA step easily enough, but I'm trying to understand how to do it using PROC SQL.

 

Thanks.

 

--Dav

1 ACCEPTED SOLUTION

Accepted Solutions
art297
Opal | Level 21

Here is an example similar to your specs:

data ahs87;
  input control status var1-var3;
  cards;
1 1 2 3 4
2 2 4 5 6
3 3 5 6 7
;

data ahs89;
  input control status weight var5-var7;
  cards;
1 9 6 2 3 4
2 8 5 4 5 6
3 7 4 5 6 7
;

proc sql noprint;
  create table want as
    select a.control,a.status as status87,
           b.status as status89,
           b.weight
      from ahs87 a join ahs89 b
        on a.control eq b.control
  ;
quit;

Art, CEO, AnalystFinder.com

 

View solution in original post

4 REPLIES 4
art297
Opal | Level 21

Here is an example similar to your specs:

data ahs87;
  input control status var1-var3;
  cards;
1 1 2 3 4
2 2 4 5 6
3 3 5 6 7
;

data ahs89;
  input control status weight var5-var7;
  cards;
1 9 6 2 3 4
2 8 5 4 5 6
3 7 4 5 6 7
;

proc sql noprint;
  create table want as
    select a.control,a.status as status87,
           b.status as status89,
           b.weight
      from ahs87 a join ahs89 b
        on a.control eq b.control
  ;
quit;

Art, CEO, AnalystFinder.com

 

Davanden
Obsidian | Level 7
Thank you. That was just what I needed.
ballardw
Super User

The type of join to specify depends on how you want non-matched records to be treated.

A simple example

 

proc sql;

   create table want as

   select ahs87.control, ahs87.status as ahs87status, ahs89.status as ahs89status, ahs89.weight

   from ahs87 left join ahs89

         on ahs87.control = ahs89.control

   ;

quit;

 

Keeps all of the records from ahs87 and only brings in the matches from ahs89. I renamed the status variables with a new name to indicate which data set contributed which value. Notice the notation of dataset.variable, that . is IMPORTANT as are the , between variables in on select clause.

Unlike a data step merge with BY the data sets involved do not need to be sorted by the matching varialbes and the variables used in the in ON clause do not have to have the same name. If the value in ahs89 that should correspond to Control in Ahs87 was name Con_89 you could use

on ahs87.control = ah89.con_89

for the match.

Unlike the data step the matching variables

Davanden
Obsidian | Level 7

Thank you for your help.  As it happened, I wanted to keep non-matching cases, as the fact that they don't match tells me something.  I used a full join to accomplish that.

 

--Dav

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 4 replies
  • 995 views
  • 1 like
  • 3 in conversation