BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
brophymj
Quartz | Level 8

I'm creating a graph with exposure across different levels of a factor, vehicle age. On a secondary axis I include the average frequency per level. My code is

proc sgplot data=dataset1 (where=(compress(level) not in ('-1','Z.Unknown')));

     title "Vehicle Age";

     vbar level / group = source response = exposure;

  run;

This produces the graph with exposure split by source for each level but I want to include the average frequency on the secondary axis but this is available on another dataset that doesn't split out exposure into source. So in dataset 1, there is an average frequency but this is given for each source in each vehicle age. I want to use the overall average frequency for each level. something like...

proc sgplot data=Egidwh_oneway_yr3 (where=(rank <= 10 and compress(factor) = "glm_4_vehicl_vehicle_age" and compress(level) not in ('-1','Z.Unknown')));
     title "&factor - Exposure By Year";
     vbar level / group = source response = exposure;
     vline level / response=freq y2axis;

  run;

but where the vline info comes from the more summarized dataset

any ideas??

1 ACCEPTED SOLUTION

Accepted Solutions
ballardw
Super User

The place to start would be to combine the data sets:

Data plot;

     set dataset1 dataset2;

run;

As long as your summary dataset does not have values for Exposure and the first no values for Freq then you should be good to go.

HINT: Keep only  the variables you need for the plot if you have lots. And since you have such complex where criteria, do that in the combination step, it looks like on the first dataset.

View solution in original post

6 REPLIES 6
ballardw
Super User

The place to start would be to combine the data sets:

Data plot;

     set dataset1 dataset2;

run;

As long as your summary dataset does not have values for Exposure and the first no values for Freq then you should be good to go.

HINT: Keep only  the variables you need for the plot if you have lots. And since you have such complex where criteria, do that in the combination step, it looks like on the first dataset.

brophymj
Quartz | Level 8

Brilliant, that makes perfect sense. Thanks

brophymj
Quartz | Level 8

I tried this but I get the following error message:

Once a GROUP variable is used in a categorical chart, that GROUP

         variable must be used in all overlaid charts.  The specified GROUP

         variable has been removed from the graph display.

here is my code:

data vehage(keep = factor level all_freq all_sev flag4 );
set Egidwh_oneway4;
if compress(factor) = "glm_4_vehicl_vehicle_age";
run;

data vehageyr(drop = all_freq all_sev ) ;
set Egidwh_oneway_yr3;
if compress(factor) = "glm_4_vehicl_vehicle_age";
run;

data comb;
set vehage vehageyr;
run;

  proc sgplot data=comb (where=(flag4 = "Y" and compress(factor) = "glm_4_vehicl_vehicle_age" and compress(level) not in ('-1','Z.Unknown')));
     title "vehicle_age - Severity";
     vbar level / group = source response=exposure;
     vline level /  response=all_freq y2axis;
  run;

ballardw
Super User

Sorry, I missed the group part before. I way is to add a level for group to the summary data (probably only one) and add to the vline syntax.

data comb;

set vehage vehageyr (in=in1);

/* assuming that vehageyr is the summary data lacking the Source Variable*/

if in1 then Source = <a source value>;

run;

to prevent the second plot from adding to the legend use the NAME="text" to name each of the plots by using KEYLEGEND "text" ; where text is the same text used for the VBAR name option.

brophymj
Quartz | Level 8

Could you give a little example of this, I don't quite follows.

Thanks

ballardw
Super User

proc sgplot data=comb (where=(flag4 = "Y" and compress(factor) = "glm_4_vehicl_vehicle_age" and compress(level) not in ('-1','Z.Unknown')));

     title "vehicle_age - Severity";

     vbar level / group = source response=exposure name='bar';

     vline level /  group=source response=all_freq y2axis name='line';

      keylegend 'bar';

  run;

sas-innovate-wordmark-2025-midnight.png

Register Today!

Join us for SAS Innovate 2025, our biggest and most exciting global event of the year, in Orlando, FL, from May 6-9. Sign up by March 14 for just $795.


Register now!

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 6 replies
  • 11352 views
  • 0 likes
  • 2 in conversation