Dear Community,
I would like to learn how to calculate the mean from string variables some of them contain comments and range.
For example, the data I have is as
ID | hours |
1 | 7-8 hours after play |
2 | 0-1 |
3 | 10-12 |
4 | 11 |
And the data wanted is
ID | hours | hours_m |
1 | 7-8 hours after play | 7.5 |
2 | 0-1 | 0.5 |
3 | 10-12 | 11 |
4 | 11 | 11 |
Thank you so much.
Use the COMPRESS function to get rid of anything but digits and hyphens, then count the "words", then calculate or just input:
data have;
infile datalines dlm="09"x dsd truncover;
input ID $ hours :$30.;
datalines;
1 7-8 hours after play
2 0-1
3 10-12
4 11
;
data want;
set have;
string = compress(hours,"-","kd");
if countw(string,"-") = 2
then hours_m = (input(scan(string,2,"-"),best.) + input(scan(string,1,"-"),best.)) / 2;
else hours_m = input(string,best.);
drop string;
run;
Use the COMPRESS function to get rid of anything but digits and hyphens, then count the "words", then calculate or just input:
data have;
infile datalines dlm="09"x dsd truncover;
input ID $ hours :$30.;
datalines;
1 7-8 hours after play
2 0-1
3 10-12
4 11
;
data want;
set have;
string = compress(hours,"-","kd");
if countw(string,"-") = 2
then hours_m = (input(scan(string,2,"-"),best.) + input(scan(string,1,"-"),best.)) / 2;
else hours_m = input(string,best.);
drop string;
run;
Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.