Solved: Count distinct values by subjects

Neff · Posted 06-17-2018 05:15 PM

I have this large data set comprising of over 1000 subjects and about 3000 variables which are ICD9 codes (Dx1- Dx3000).

Dx represents the ICD diagnosis codes for each visit and is represented here as 1-4

My dataset looks similar to this:

I would like to get the distinct disease count for each subject.

For example: subject 1 has 4 visits but 3 distinct diagnosis.

subject 2 has 4 visits and 4 distinct diagnosis

Thanks.

novinosrin · Posted 06-17-2018 05:29 PM

1. proc transpose

2. proc freq by id

View solution in original post

Neff · Posted 06-17-2018 05:15 PM

I have this large data set comprising of over 1000 subjects and about 3000 variables which are ICD9 codes (Dx1- Dx3000).

Dx represents the ICD diagnosis codes for each visit and is represented here as 1-4

My dataset looks similar to this:

I would like to get the distinct disease count for each subject.

For example: subject 1 has 4 visits but 3 distinct diagnosis.

subject 2 has 4 visits and 4 distinct diagnosis

Thanks.

novinosrin · Posted 06-17-2018 05:29 PM

1. proc transpose

2. proc freq by id

PGStats · Posted 06-17-2018 06:00 PM

proc transpose data=myData out=codes;
by id;
var dx1-dx3000;
run;

proc sql;
create table counts as
select id, count(distinct col1) as nbCodes
from codes
group by id;
quit;

(untested)

PG

Neff · Posted 06-17-2018 07:50 PM

Thank you so much!! Just what I needed and it worked perfectly well

Count distinct values by subjects

Re: Count distinct values by subjects

Count distinct values by subjects

Re: Count distinct values by subjects

Re: Count distinct values by subjects

Re: Count distinct values by subjects

Count distinct values by subjects

Re: Count distinct values by subjects

Count distinct values by subjects

Re: Count distinct values by subjects

Re: Count distinct values by subjects

Re: Count distinct values by subjects

SAS Innovate 2025: Save the Date