Solved: Re: Count number of distinct cases meeting different criteria

lizzy28 · Posted 10-31-2014 12:54 PM

I would like to count numbers of distinct cases that meet different criteria in my large dataset. The example data looks like below:

obsid	var
1	a
1	b
1	b
2	b
2	b
2	c
2	c
3	a
3	a
4	c

I need to see how many distinct obsids that contain "a", "b" or "c" in var.

I could do for different criteria one after another like this:

proc sql; select count(distinct obsid) as N_a from project.dswi_nodebride where var="a"; run;

proc sql; select count(distinct obsid) as N_b from project.dswi_nodebride where var="b"; run;

proc sql; select count(distinct obsid) as N_c from project.dswi_nodebride where var="c"; run;

I have many criteria, so is there a simpler way to do for all different but similar criteria?

Thanks a lot!

PGStats · Posted 10-31-2014 01:05 PM

SQL is ideal for this:

data have;

input obsid var$;

datalines;

1 a

1 b

2 b

2 c

3 a

4 c

;

proc sql;

create table want as

select var, count(distinct obsid) as n

from have

group by var;

select * from want;

quit;

PG

View solution in original post

stat_sas · Posted 10-31-2014 01:04 PM

proc sql;

select var,count(distinct obsid) as N from have

group by var;

quit;

PGStats · Posted 10-31-2014 01:05 PM