Statistical Procedures

Programming the statistical procedures from SAS
BookmarkSubscribeRSS Feed
ShufeGuoding
Obsidian | Level 7

How the statistical procedures in SAS, such as proc means, proc model , to quote only a few, to read data from the SAS datasets? , read the all observations once a time to calculate the statistics, such as sample mean,  or read one observation each time into PDV as DATA STEP and calculate the statistics each time? I can 't find the answer in SAS document and ask help.     

2 REPLIES 2
Astounding
PROC Star

It's not in the documentation.  It's proprietary information.  Most of it is a black box that we as users neither observe nor control.  To a very small extent, some procedures (such as SQL) contain options that influence how observations are read.  But that's all that is available.  

 

When you mention the PDV, you are talking about a DATA step.  There is no evidence that I know of that talks about how PROC steps handle this.  

 

Because of the behavior of PROC MEANS (and other summary procedures), we can infer that SAS computes some statistics at a detailed level.  For example, it computes SUM, N, and sum of squared values at a detailed level.  Then as needed, it computes final statistics.  (Use SUM and N to get the MEAN, use SUM, N, and sum of squared values to get the standard deviation.  Aggregate the building blocks to a higher level in order to compute statistics at a higher level.)

ShufeGuoding
Obsidian | Level 7

Some statistical analysis procedures, such as Proc Model, Proc SSM, PROC MCMC have their programming  statements, most of which are the same as those in DATA step. There is even a PDV option in OUTPUT statement  of PROC SSM. Put things together , I infer that SAS PROCs  read and deal with the observations in SAS data sets in the similar way to that in DATA STEP. 

sas-innovate-wordmark-2025-midnight.png

Register Today!

Join us for SAS Innovate 2025, our biggest and most exciting global event of the year, in Orlando, FL, from May 6-9. Sign up by March 14 for just $795.


Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 711 views
  • 0 likes
  • 2 in conversation