Hi,
I have a dataset with categories in different order. I want to create a consistency in the order as [(i)cyt, (ii) end therapy, (iii) targ]. Also delete the (i),(ii), and (iii).
Different versions are [(ii)cyt, (iii) targ, (ii) end therapy], [(iii)end therapy, (i) cyt, (iii) targ], etc. Please see the data below. Thanks
Data Have:
ID
Categories
1
(i)cyt, (ii) end therapy, (iii) targ
2
(ii)cyt, (iii) targ, (ii) end therapy
3
(iii)end therapy, (i) cyt, (iii) targ
4
iii) targ, (i) cyt, (ii) end therapy
5
(iii)end therapy, (i) cyt, (iii) targ
6
iii) targ, (i) cyt, (ii) end therapy
7
(i)cyt, (ii) end therapy, (iii) targ
8
(ii)cyt, (iii) targ, (ii) end therapy
9
(ii)cyt
10
(iii) targ
11
(ii) end therapy
Data Want:
ID
Categories
Clean_Category
1
(i)cyt, (ii) end therapy, (iii) targ
Cyt, End Therapy, Targ
2
(ii)cyt, (iii) targ, (ii) end therapy
Cyt, End Therapy, Targ
3
(iii)end therapy, (i) cyt, (iii) targ
Cyt, End Therapy, Targ
4
iii) targ, (i) cyt, (ii) end therapy
Cyt, End Therapy, Targ
5
(iii)end therapy, (i) cyt, (iii) targ
Cyt, End Therapy, Targ
6
iii) targ, (i) cyt, (ii) end therapy
Cyt, End Therapy, Targ
7
(i)cyt, (ii) end therapy, (iii) targ
Cyt, End Therapy, Targ
8
(ii)cyt, (iii) targ, (ii) end therapy
Cyt, End Therapy, Targ
9
(ii)cyt
Cyt
10
(iii) targ
Targ
11
(ii) end therapy
End Therapy
data have;
infile datalines dlm=':';
input ID Categories :$100.;
datalines;
1:(i)cyt,(ii)end therapy,(iii)targ
2:(ii)cyt,(iii)targ,(ii)end therapy
3:(iii)end therapy,(i)cyt,(iii)targ
4:iii)targ,(i)cyt,(ii)end therapy
5:(iii)end therapy,(i)cyt,(iii)targ
6:iii)targ,(i)cyt,(ii)end therapy
7:(i)cyt,(ii)end therapy,(iii)targ
8:(ii)cyt,(iii)targ,(ii)end therapy
9:(ii)cyt
10:(iii)targ
11:(ii)end therapy
;
Run;
... View more