BookmarkSubscribeRSS Feed
KP12
Fluorite | Level 6

Hi all,

I have a following query, which returns 636302 number of records:

proc sql;

create table application as

select distinct  a.*,  b.App1, b.App2, c.EMP1

from app1 a left join app2 b on (a.Application_Number=b.AA_No) left join app3 c on (a.Application_Number=c.APP_ID);

quit;

I tried rewriting the above query using merge as follows, but this query is returning 635610 number of records:


data application;

merge app1 (IN=a keep=_all_) app2 (in=b keep= App1 App2 AA_No rename=(AA_No=Application_Number))

app3 (in=c keep=EMP1 APP_ID rename=(APP_ID=Application_Number));

by Application_Number;

if a;

RUN;

Can someone please tell me if I'm missing something above in the merge query because of which a difference of 692 records is coming ?

2 REPLIES 2
Patrick
Opal | Level 21

A SAS merge doesn't work exactly the same like a SQL join and you will get different results for many:many relationships.

Please refer to this post and the links provided there

Kurt_Bremser
Super User

If you have 3 records for a key in dataset1, and 2 records for a key in dataset2, a merge in a data step by that key will give you 3 records, the join in SQL will give you 6 (cartesian product).

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 2 replies
  • 1240 views
  • 0 likes
  • 3 in conversation