I am working with a transaction type data set where there are multiple observations for individual children. Each child has a unique "child_id" and there are multiple observations, for which the child has an associated value for the variable TREATMENT. There may or may not be repeats for the values of TREATMENT.
There are around 80,000 children. There an indefinite number of TREATMENT paths based on around 30 TREATMENT types.
The following is similar to what the data currently looks like:
Child Treatment
1 A
1 C
1 C
1 D
1 B
2 B
2 B
2 A
2 B
The following is just a brief example of what I would like the data to look like. The TREATMENT should only increment when the treatment type has changed.
Child Treatment_1 Treatment_2 Treatment_3 Treatment_4 Treatment_n
1 A C D B
2 B A B
Thank you for your insight
data need;
set have;
by child treatment notsorted;
if last.treatment then output;
run;
proc transpose data=need out=want prefix=treatment_;
by child;
var treatment;
run;
Another Solution with Data Step.
You need to have the data set, HAVE. to be pre-sorted by Child.
The array size is taken as 4 but can be extended as desired.
data want;
array t
do until(last.Child);
set have;
by Child;
if first.Child then i = 1; t = Treatment; i + 1; end;
else if Treatment ^= t[i - 1] then do; t = Treatment; i + 1; end;
end;
drop i Treatment;
run;
Registration is now open for SAS Innovate 2025 , our biggest and most exciting global event of the year! Join us in Orlando, FL, May 6-9.
Sign up by Dec. 31 to get the 2024 rate of just $495.
Register now!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.