Hello,
I have 2 different datasets. Dataset 1 has about 500 observations and 15 variables and Dataset 2 has 7500 observations and 250 variables.
I have to import that 250 variables from Dataset 2 for that 500 observations in Dataset 1 (note 500 observations in dataset 1 are present in dataset 2 by three common identifiers: last name, first name and IDnumber)
If I use a
merge dataset1 dataset2; by firstname lastname IDnumber;
I am thinking I am going to get 7500 observations with 500 of them having that 15 extra variables (-3 common identifiers) from dataset 1.
Any idea how else I could use to merge this and just have a new combined dataset that has 500 observations from Dataset with 15 + 250 variables ?
Your help is much appreciated !!!
You just need an extra statement:
merge dataset1(in=inD1) dataset2; by firstname lastname IDnumber;
if inD1;
PG
Figured it out myself.
Just a simple delete command for a variable thats only present in dataset 1 but not in dataset2 by writing
if variable =' ' then delete;
worked like a charm
Just a little extra thinking that I was lacking when initially thinking about it.
Thanks anyways for folks reading this !
You just need an extra statement:
merge dataset1(in=inD1) dataset2; by firstname lastname IDnumber;
if inD1;
PG
Thanks so much. Thats really good to know !
April 27 – 30 | Gaylord Texan | Grapevine, Texas
Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!
Still thinking about your presentation idea? The submission deadline has been extended to Friday, Nov. 14, at 11:59 p.m. ET.
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.