Hi
how to move rows to columns:
id attr
1 phone1
1 phone2
1 addr1
2 phone1
3 addr1
3 addr2
4 email1
id phone addr email
1 phone1,phone2 addr1
2 phone1
3 addr1,addr2
4 email1
I can only guess at the variable lengths you might really need, but I think you are looking for something like:
data have;
informat attr $80.;
input id attr &;
cards;
1 555-555-5555
1 (555) 555-5555
1 80 Here Street, Somewhere
2 1-555-555-5555
3 90 There Road, Elsewhere
3 Apt 1A
;
data want (keep=email address phone);
set have;
retain pattern address phone email;
length address phone email $80;
by id;
if _n_ = 1 then pattern = prxparse("/\d\d\d-\d{4}/");
if first.id then do;
call missing(address);
call missing(email);
call missing(phone);
end;
if PRXMATCH(pattern,attr) then phone=catx(',',phone,attr);
else if index(attr,"@") then email=catx(',',email,attr);
else address=catx(',',address,attr);
if last.id then output;
run;
Tell us more! Is your example really what your data look like or do the phone numbers look like typical phone numbers (and, if so, how many different types are there? e.g. 555-555-5555, 1-555-555-5555, (555) 555-5555), is the @ symbol used in any other fields other than email addresses?
Or are you only asking regarding the specifics you initially posted?
phone is typical phone numbers
addr is real address
and email has @
I can only guess at the variable lengths you might really need, but I think you are looking for something like:
data have;
informat attr $80.;
input id attr &;
cards;
1 555-555-5555
1 (555) 555-5555
1 80 Here Street, Somewhere
2 1-555-555-5555
3 90 There Road, Elsewhere
3 Apt 1A
;
data want (keep=email address phone);
set have;
retain pattern address phone email;
length address phone email $80;
by id;
if _n_ = 1 then pattern = prxparse("/\d\d\d-\d{4}/");
if first.id then do;
call missing(address);
call missing(email);
call missing(phone);
end;
if PRXMATCH(pattern,attr) then phone=catx(',',phone,attr);
else if index(attr,"@") then email=catx(',',email,attr);
else address=catx(',',address,attr);
if last.id then output;
run;
Good news: We've extended SAS Hackathon registration until Sept. 12, so you still have time to be part of our biggest event yet – our five-year anniversary!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.