BookmarkSubscribeRSS Feed
Bal23
Lapis Lazuli | Level 10

I need to read the dataset contains both icd-9 and icd-10. My understanding is very limited.

So I am thinking to develop a program, that use the ANYALPHA and ANYDIGIT functions. But I am not clear how to do it.

 

it would be, if the first character of the icd is numeric, then it is icd-9;

if it is "V" or "E", it would be icd-9,

else it would be icd-10;

anyone with this expereince?

tips on how to use anyalpha, anydigit funtions?

Thanks.

4 REPLIES 4
Astounding
PROC Star

These statements could go inside a DATA step:

 

length type $ 6;

if icd > ' ' then do;

   if upcase(left(icd)) in : ('V', 'E', '0', '1', '2', '3', '4', '5', '6', '7', '8', '9') then type='icd-9';

   else type='icd-10';

end;

Bal23
Lapis Lazuli | Level 10

thank you

when i think it over, i feel my knowledge is not enough

data is unlimited, so there should be some other options, so the value might be icd-9, or icd-10, or both, or other 3.

by using your code, i find there are around 60% of obs using icd10, 40% using icd9.

Now I think I would ask for advice, if there are four options, listed above, other than two options only.

I do not know how it looks like if the code is both, and how to develop a program to read that. any advice?

Thanks.

 

 

 

 

if the value of icd is more than 5 characters but less than 8, I can call it is icd10,

if the value has less than 3 character or more than 8 characters, i will call it "other"

...

Astounding
PROC Star

When it comes to questions about what is in your data, and how you should interpret it, I'm not sure I can help.  For example, I'm not sure when "both" would be appropriate.  But the issues you are talking about are cases that SAS can handle.

 

Notice that  your code contains a logical error.  ELSE applies to only the previous IF/THEN statement, not to both as a group.  It would be improved by changing the second IF/THEN statement to say:

 

else if upcase(left(icd)) in : ("A", "B", "C" ...) then type='icd=10';

 

It would help to know whether ICD is already left-hand justified or not.  If it is, you can eliminate the LEFT function wherever any of the sample code uses it.

 

To measure and use the length of ICD, these statements might be appropriate:

 

len = length(left(icd));

if len > 8 then type='other';

else if len < 3 then type='other';

else if len in (6, 7) then type='icd-10';

 

There are many ways to set up these statements.  None of them are complex, but it takes some attention to the details to decide on the proper set of rules.

Reeza
Super User

I'm going to suggest a different but approach, build a library of ALL ICD9 and ICD10 codes, preferably each as a format. Then check each value against that list and see if it's present in either, both, or neither and classify accordingly.  Lists of each set of codes are available online.

 

 

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 4 replies
  • 1465 views
  • 2 likes
  • 3 in conversation