Solved: Re: How to match and merge a data

hjjijkkl · Posted 02-22-2021 10:15 AM

I have to data with the same variable name. I want to merge the data but, keep them in two separate columns. Is it possible to match and merge?

ID

2

3

5

ID

1

2

3

4

5

I would like to have an output that looks like this

ID	ID1
1
2	2
3	3
4
5	5

mkeintz · Posted 02-22-2021 11:34 AM

This is a situation where using the "IN=" parameter for a data set name is just what you need:

data have_1;
  input id @@ ;
datalines;
2 3 5
run;

data have_2;
  input id @@ ;
datalines;
1 2 3 4 5
run;

data want;
  merge have_1 (in=in1) have_2 ;
  by id;
  if in1 then id1=id;
run;

Of course, this assumes that HAVE_1 and HAVE_2 are both sorted by id.

--------------------------
The hash OUTPUT method will overwrite a SAS data set, but not append. That can be costly. Consider voting for Add a HASH object method which would append a hash object to an existing SAS data set

Would enabling PROC SORT to simultaneously output multiple datasets be useful? Then vote for
Allow PROC SORT to output multiple datasets

--------------------------

View solution in original post

mklangley · Posted 02-22-2021 10:30 AM

data have_1;
    input id;
    datalines;
2
3
5
;
run;

data have_2;
    input id;
    datalines;
1
2
3
4
5
;
run;

proc sql;
    create table want as
    select a.id
          ,b.id as id1
    from have_2 a
    full join have_1 b 
        on a.id = b.id;
quit;

hjjijkkl · Posted 02-22-2021 10:53 AM

Is there another way to do it without using proc sql?

hjjijkkl · Posted 02-22-2021 11:00 AM

Is there another way to do it without using proc sql?

mklangley · Posted 02-22-2021 11:08 AM

There might be an easier way, but here is one way to do it using a DATA step merge:

data have_1;
    input id;
    datalines;
2
3
5
;
run;

data have_2;
    input id;
    datalines;
1
2
3
4
5
;
run;

data have_1;
    set have_1;
    id1 = id;
run;

data want;
    merge have_1 (in=one)
          have_2 (in=two);
    by id;
    if one or two;
run;

mkeintz · Posted 02-22-2021 11:34 AM

This is a situation where using the "IN=" parameter for a data set name is just what you need:

data have_1;
  input id @@ ;
datalines;
2 3 5
run;

data have_2;
  input id @@ ;
datalines;
1 2 3 4 5
run;

data want;
  merge have_1 (in=in1) have_2 ;
  by id;
  if in1 then id1=id;
run;

Of course, this assumes that HAVE_1 and HAVE_2 are both sorted by id.

--------------------------
The hash OUTPUT method will overwrite a SAS data set, but not append. That can be costly. Consider voting for Add a HASH object method which would append a hash object to an existing SAS data set

Would enabling PROC SORT to simultaneously output multiple datasets be useful? Then vote for
Allow PROC SORT to output multiple datasets

--------------------------

How to match and merge a data

Re: How to match and merge a data

Re: How to match and merge a data

Re: How to match and merge a data

Re: How to match and merge a data

Re: How to match and merge a data

Re: How to match and merge a data

Catch up on SAS Innovate 2026