BookmarkSubscribeRSS Feed
shivkr
Fluorite | Level 6

Hi All,

 

In which senario we should use proc sql over data step while merging two dataset? If I have a dataset with 2 million records and other has only 200 records. which method should I use while joining these two tables?

 

Thanks,

 

 

 

2 REPLIES 2
Astounding
PROC Star

The choice usualy depends on the result you want:

 

  • Is there a one-to-one match, a many-to-one match, or a many-to-many match
  • Do you want matches only, or do you want some (or all) of the mismatches?

It might also depend on your skill with the DATA step vs. SQL.

 

The bottom line is you have to describe the inputs and outputs at least a little bit.

Reeza
Super User

There's no hard and fast rule. And it isn't only between SQL and data step, hash objects and formats can also be options for lookups. 

 

In general, a many to many merge is often best done in SQL. Everything else has multiple options. 

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 2 replies
  • 943 views
  • 0 likes
  • 3 in conversation