Hi everyone,
I'm trying to create a box plot that considers frequencies (similar to how you can specify FREQ or WEIGHT in PROC MEANS). My data is grouped into 16 categories, where each one should have its own box. After going through the documentation, I haven't been able to find any options in the BOXPLOT procedure itself.
I've also tried to create a summary dataset using MEANS and then using it as a HISTORY dataset in the BOXPLOT procedure, but the format of the dataset doesn't match what is needed for a BOXPLOT.
If I could somehow rename the dataset, I'd imagine the code below would work:
PROC MEANS DATA=DISTANCE_BY_GROUP MIN P25 MEDIAN MEAN P75 MAX STD N;
BY GROUP_NO;
FREQ N_CLAIMS;
OUTPUT OUT=SUMM_STATS;
RUN;
PROC BOXPLOT HISTORY=SUMM_STATS;
PLOT DISTANCE*GROUP_NO;
RUN;
Does anyone have any suggestions on how to handle this?
Thanks in advance!
Try something like this:
proc sgplot data=DISTANCE_BY_GROUP;
vbox your_analysis_var / category=GROUP_NO freq=N_CLAIMS;
run;
Hope this helps!
Dan
Try something like this:
proc sgplot data=DISTANCE_BY_GROUP;
vbox your_analysis_var / category=GROUP_NO freq=N_CLAIMS;
run;
Hope this helps!
Dan
Registration is open! SAS is returning to Vegas for an AI and analytics experience like no other! Whether you're an executive, manager, end user or SAS partner, SAS Innovate is designed for everyone on your team. Register for just $495 by 12/31/2023.
If you are interested in speaking, there is still time to submit a session idea. More details are posted on the website.
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.