BookmarkSubscribeRSS Feed
stanmarciano
Fluorite | Level 6

This is not a technical/code related question (more of a general question related to proc GLM). I had some codes written for a project a couple of years back (2 factor anova model). I used  Proc GLM. A simple version of the code is below:

 

proc glm data = mydata;
class Independent_Var1 Independent_Var2;
model Dependent_Var=Independent_Var1|Independent_Var2/ss3 solution;
lsmeans Independent_Var1|Independent_Var2;
run;

I recently dug out this long forgotten project and started working on it again. I tried to replicate my previous findings (dataset is exactly the same). However, I am not getting my previous results back. There are slight changes in the F statistic and p-values. For example for Independent_Var1, the original p-value was 0.08 and I now get 0.12. Again, with the same dataset, I would expect to get my previous results.

 

Can anyone help me understand why I am getting somewhat different results. Have there been any changes or updates to Proc GLM over the last couple of years or so? Is there a resource to track changes to the proc's? I do keep my data and codes organized so I am confident that I am using my original data as well as the code. 

 

Thanks in advance.

 

EDIT: During this time, I have gone from using SAS 9.2 to SAS 9.4. 

3 REPLIES 3
PaigeMiller
Diamond | Level 26

There are three possible reasons why the results might have changed

 

  1. The data is different
  2. The code is different
  3. PROC GLM has changed

 

You can do the research to confirm that #1 and #2 have not happened. Simply saying they haven't changed (and using bold letters) doesn't convince me. If you conclude that #3 is the only possibility, we would need to see the results before and the results after.

--
Paige Miller
Rick_SAS
SAS Super FREQ

If you post your actual code and data, we can probably help more. Since the F statistic for the ANOVA model was developed 100 years ago, I don't think it is likely that the computation changed between SAS 9.2 and 9.4. 

SteveDenham
Jade | Level 19

If the method was iterative (say PROC MIXED), I would also look at hardware and OS differences.  But to find this for PROC GLM seems odd.  Are you sure there is no data preprocessing in the original project that has been dropped out in the current project?

 

SteveDenham

SAS Innovate 2025: Call for Content

Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!

Submit your idea!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 3 replies
  • 580 views
  • 0 likes
  • 4 in conversation