I was trying proc sgplot to generate a box plot. It was aborted due to memory issue. The data set has about 1.7 million records, but there's only 16 distinct values for the x-axis. There are also some other procedures in SAS that can do box plot. I was wondering which procedure can handle big data sets better. I would like to add a limit that if the number of rows exceeds the limit it will not do the box plot. But I'm not sure how to calculate the limit based on the memory I have.
You should show the code used in case you specified something that might be an issue. If you requested plots for multiple variables that might cause some issue.
I just generated a random data set with 1.7 million records and with one analysis variable and 16 x values proc boxplot completed the graph in under 1 second. So the file size is unlikely to be the main issue.
The code is super simple:
proc sgplot data = t2ds;
vbox T2 / category = fis_week;
refline LCL;
refline UCL;
refline CTL;
format fis_week $4.;
run;
Thanks ballardw for pointing me to the right direction. I think it's the format statement that's causing the problem. After I removed the format. It completed in seconds.
Don't miss out on SAS Innovate - Register now for the FREE Livestream!
Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.