Hi everyone,
I'm trying to create a box plot that considers frequencies (similar to how you can specify FREQ or WEIGHT in PROC MEANS). My data is grouped into 16 categories, where each one should have its own box. After going through the documentation, I haven't been able to find any options in the BOXPLOT procedure itself.
I've also tried to create a summary dataset using MEANS and then using it as a HISTORY dataset in the BOXPLOT procedure, but the format of the dataset doesn't match what is needed for a BOXPLOT.
If I could somehow rename the dataset, I'd imagine the code below would work:
PROC MEANS DATA=DISTANCE_BY_GROUP MIN P25 MEDIAN MEAN P75 MAX STD N;
BY GROUP_NO;
FREQ N_CLAIMS;
OUTPUT OUT=SUMM_STATS;
RUN;
PROC BOXPLOT HISTORY=SUMM_STATS;
PLOT DISTANCE*GROUP_NO;
RUN;
Does anyone have any suggestions on how to handle this?
Thanks in advance!
Try something like this:
proc sgplot data=DISTANCE_BY_GROUP;
vbox your_analysis_var / category=GROUP_NO freq=N_CLAIMS;
run;
Hope this helps!
Dan
Try something like this:
proc sgplot data=DISTANCE_BY_GROUP;
vbox your_analysis_var / category=GROUP_NO freq=N_CLAIMS;
run;
Hope this helps!
Dan
April 27 – 30 | Gaylord Texan | Grapevine, Texas
Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!
Still thinking about your presentation idea? The submission deadline has been extended to Friday, Nov. 14, at 11:59 p.m. ET.
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.