Hello,
I have 2 different datasets. Dataset 1 has about 500 observations and 15 variables and Dataset 2 has 7500 observations and 250 variables.
I have to import that 250 variables from Dataset 2 for that 500 observations in Dataset 1 (note 500 observations in dataset 1 are present in dataset 2 by three common identifiers: last name, first name and IDnumber)
If I use a
merge dataset1 dataset2; by firstname lastname IDnumber;
I am thinking I am going to get 7500 observations with 500 of them having that 15 extra variables (-3 common identifiers) from dataset 1.
Any idea how else I could use to merge this and just have a new combined dataset that has 500 observations from Dataset with 15 + 250 variables ?
Your help is much appreciated !!!
You just need an extra statement:
merge dataset1(in=inD1) dataset2; by firstname lastname IDnumber;
if inD1;
PG
Figured it out myself.
Just a simple delete command for a variable thats only present in dataset 1 but not in dataset2 by writing
if variable =' ' then delete;
worked like a charm
Just a little extra thinking that I was lacking when initially thinking about it.
Thanks anyways for folks reading this !
You just need an extra statement:
merge dataset1(in=inD1) dataset2; by firstname lastname IDnumber;
if inD1;
PG
Thanks so much. Thats really good to know !
Don't miss out on SAS Innovate - Register now for the FREE Livestream!
Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.