Hi all, I have a dataset with approximately 5.5 million observations of 3 numeric variables. All I need are the means and a few quantiles for all 3 variables stored in a table. I am currently running the following code and getting the desired result: proc univariate data=MYDATA outtable=UNIV (keep=_var_ _min_ _p5_ _q1_ _median_ _mean_ _q3_ _p95_ _max_) noprint;
run; This, however, seems like a waste of resources as it computes a number of statistics which I then drop immediately, and it also generates a warning about the number of observations being too large to calculate Qn, which I do not need here. The number of nonmissing observations for variable X is too large to compute the robust measure of scale Qn. The statistic Qn is set to missing. In sum, the code does what I want, but makes a number of unnecessary computations. In order to save time and resources, and also just out of interest, I was wondering whether there was any way to restrict the computations to a list of explicitly requested statistics. Thanks in advance for your expertise.
... View more