BookmarkSubscribeRSS Feed
dustychair
Pyrite | Level 9

Hi All,

I have a data set as below: I want to compare the histograms of the scores of two different groups with all possible combinations. For example,

comparison of  scores for groups ts=A and tc=B;

comparison of  scores for groups ts=B and tc=B;

comparison of  scores for  groups ts=B and tc=A;

comparison of  scores for groups ts=A and tc=A;

 

So, there will be 4 different overlaying histograms. Could you help to draw overlaying histograms of these groups' scores?

 

Thank you,

 

 

score ts tc matching
180 A B 0
200 B B 1
360 A B 0
420 A A 1
500 B A 0
234 B B 1
461 B A 0

2 REPLIES 2
Jagadishkatam
Amethyst | Level 16

Please try below code

 

data have;
input score ts$ tc$ matching;
cards;
180 A B 0
200 B B 1
360 A B 0
420 A A 1
500 B A 0
234 B B 1
461 B A 0
;

data want;
set have;
if ts='A' and tc='B' then group='1';
if ts='B' and tc='B' then group='2';
if ts='B' and tc='A' then group='3';
if ts='A' and tc='A' then group='4'; 
run;

proc sgplot data=want;  
vbar group/response=score; 
run;

image.png

Thanks,
Jag
Rick_SAS
SAS Super FREQ

I don't think the problem is formulated correctly. To get two overlayed histograms, you need some observations that belong to one group and other (disjoint) observations in another group. But in your example,  ts='A' includes observations for both tc='A' and tc='B'. So you can't overlay the histograms. There are not four histograms to overlay.

 

Two things that you can do:

1. create two overlapping histograms. The first compares tc='A' and tc='B' given that ts='A'. The second compares tc='A' and tc='B' given that ts='B'. 

2. create a panel of histograms of the four groups, which are the joint levels of (ts, tc). 

 

Both of these options are shown below on some simulated data:

data Have;
do ts = 'A', 'B';
   do tc = 'A', 'B';
      do k=1 to 50;
         score = 3*(ts='A') + -1*(tc='A') + rand("Normal");
         output;
      end;
   end;
end;
run;

/* 1. Given a value for ts, overlay the distributions for levels of tc */
proc sgplot data=Have;
by ts;
histogram score / group=tc transparency=0.5;
run;

/* 2. Panel the four distributions */
data Want;
set Have;
group = cats("ts=", ts, "; tc=", tc); 
run;

proc sgpanel data=Want;
panelby group / onepanel columns=1 novarname;
histogram score;
run;

SAS INNOVATE 2024

Innovate_SAS_Blue.png

Registration is open! SAS is returning to Vegas for an AI and analytics experience like no other! Whether you're an executive, manager, end user or SAS partner, SAS Innovate is designed for everyone on your team. Register for just $495 by 12/31/2023.

If you are interested in speaking, there is still time to submit a session idea. More details are posted on the website. 

Register now!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

Get the $99 certification deal.jpg

 

 

Back in the Classroom!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 2 replies
  • 330 views
  • 0 likes
  • 3 in conversation