Hi
how to move rows to columns:
id attr
1 phone1
1 phone2
1 addr1
2 phone1
3 addr1
3 addr2
4 email1
id phone addr email
1 phone1,phone2 addr1
2 phone1
3 addr1,addr2
4 email1
I can only guess at the variable lengths you might really need, but I think you are looking for something like:
data have;
informat attr $80.;
input id attr &;
cards;
1 555-555-5555
1 (555) 555-5555
1 80 Here Street, Somewhere
2 1-555-555-5555
3 90 There Road, Elsewhere
3 Apt 1A
;
data want (keep=email address phone);
set have;
retain pattern address phone email;
length address phone email $80;
by id;
if _n_ = 1 then pattern = prxparse("/\d\d\d-\d{4}/");
if first.id then do;
call missing(address);
call missing(email);
call missing(phone);
end;
if PRXMATCH(pattern,attr) then phone=catx(',',phone,attr);
else if index(attr,"@") then email=catx(',',email,attr);
else address=catx(',',address,attr);
if last.id then output;
run;
Tell us more! Is your example really what your data look like or do the phone numbers look like typical phone numbers (and, if so, how many different types are there? e.g. 555-555-5555, 1-555-555-5555, (555) 555-5555), is the @ symbol used in any other fields other than email addresses?
Or are you only asking regarding the specifics you initially posted?
phone is typical phone numbers
addr is real address
and email has @
I can only guess at the variable lengths you might really need, but I think you are looking for something like:
data have;
informat attr $80.;
input id attr &;
cards;
1 555-555-5555
1 (555) 555-5555
1 80 Here Street, Somewhere
2 1-555-555-5555
3 90 There Road, Elsewhere
3 Apt 1A
;
data want (keep=email address phone);
set have;
retain pattern address phone email;
length address phone email $80;
by id;
if _n_ = 1 then pattern = prxparse("/\d\d\d-\d{4}/");
if first.id then do;
call missing(address);
call missing(email);
call missing(phone);
end;
if PRXMATCH(pattern,attr) then phone=catx(',',phone,attr);
else if index(attr,"@") then email=catx(',',email,attr);
else address=catx(',',address,attr);
if last.id then output;
run;
Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.