The code below creates a unique count of the variable y for each value of x and a summary row. However, it requires two queries and a data step. Is there a way to all in one query?
/*sample data*/
data a;
input x $1. y 1.;
datalines;
a1
a2
a2
b1
b3
;
run;
proc sql;
/*first query*/
create table b as select x, n(distinct y) as y_cnt
from a
group by x;
/*second query*/
create table c as select 'Total' as x, n(distinct y) as y_cnt
from a;
quit;
data d;
length x $5;
/*append queries*/
set b c;
run;
Do your records overlap within X/ by group?
ie If you sum the total from the summary total are you expecting it to add to the total, or do you need a separate distinct count because of the overlap?
If you need that distinction I can't think of an alternative 😞
Hopefully someone else can, so partially responding to see if there is a solution to this problem.
April 27 – 30 | Gaylord Texan | Grapevine, Texas
Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!
Still thinking about your presentation idea? The submission deadline has been extended to Friday, Nov. 14, at 11:59 p.m. ET.
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.