Dear All,
I have a large dataset with diagnosis codes that are up to four digits. I need to make them more uniform by adding zeros preceding the first digit (letter) to make them 4 digits but in a character format.
I've tried this but I end up with all zeros.
data diagnosis;
input diagnosis;
datalines;
094
16
E4
;
run;
data diagnosis;
set diagnosis;
diagnose=translate(right(diagnosis),'0',' ');
;
run;
My desired output would be this:
0094 |
0016 |
00E4 |
First of all, your first data step won't really work, as you try to create a numeric variable, so add a dollar sign:
data diagnosis;
input diagnosis $;
datalines;
094
16
E4
;
run;
Then try this:
data diagnosis;
set diagnosis;
length newdiag $4;
newdiag = '0000';
substr(newdiag,5-length(diagnosis)) = diagnosis;
drop diagnosis;
rename newdiag=diagnosis;
run;
First of all, your first data step won't really work, as you try to create a numeric variable, so add a dollar sign:
data diagnosis;
input diagnosis $;
datalines;
094
16
E4
;
run;
Then try this:
data diagnosis;
set diagnosis;
length newdiag $4;
newdiag = '0000';
substr(newdiag,5-length(diagnosis)) = diagnosis;
drop diagnosis;
rename newdiag=diagnosis;
run;
Perfect. This works well.
data want; var="E4"; want=cats(repeat('0',3-lengthn(var)),var); run;
data diagnosis;
input diagnosis $;
datalines;
094
16
E4
;
run;
data diagnosis;
set diagnosis;
length newdiag $4;
newdiag = diagnosis;
newdiag= translate(right(newdiag),'0',' ');
run;
Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!
Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.