BookmarkSubscribeRSS Feed
shivkr
Fluorite | Level 6

Hi All,

 

In which senario we should use proc sql over data step while merging two dataset? If I have a dataset with 2 million records and other has only 200 records. which method should I use while joining these two tables?

 

Thanks,

 

 

 

2 REPLIES 2
Astounding
PROC Star

The choice usualy depends on the result you want:

 

  • Is there a one-to-one match, a many-to-one match, or a many-to-many match
  • Do you want matches only, or do you want some (or all) of the mismatches?

It might also depend on your skill with the DATA step vs. SQL.

 

The bottom line is you have to describe the inputs and outputs at least a little bit.

Reeza
Super User

There's no hard and fast rule. And it isn't only between SQL and data step, hash objects and formats can also be options for lookups. 

 

In general, a many to many merge is often best done in SQL. Everything else has multiple options. 

hackathon24-white-horiz.png

2025 SAS Hackathon: There is still time!

Good news: We've extended SAS Hackathon registration until Sept. 12, so you still have time to be part of our biggest event yet – our five-year anniversary!

Register Now

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 2 replies
  • 1457 views
  • 0 likes
  • 3 in conversation