BookmarkSubscribeRSS Feed
Piers_C
Calcite | Level 5

Thanks for the advice, I will give this a go. A colleague of mine also suggested subsampling the large distribution of residuals, and testing the smaller subsamples for normality. Perhaps if subsamples also failed the KS and/or other tests this might be a stronger indication of an undiagnosed problem?

 

 

3 REPLIES 3
Rick_SAS
SAS Super FREQ

How are you conducting this analysis? GLM? GLIMMIX? Suppose you determine that the errors are slightly heavy-tailed? How will that change the way you conduct the analysis? 

 

Are you just "verifying assumptions" or is there a real problem that you are trying to resolve?

 

Piers_C
Calcite | Level 5

I am using proc mixed.

 

My model code is:

 

title 'TOTAL Hi frequency HI v LO FULL MLM';

proc mixed data=mlm_hi covtest;

 class sub sess fbin tbin;

 model t_diff=tbin|fbin sess / solution outpredm = pred_hi;

 repeated / subject=sub(sess) type=sp(gau) (tbin fbin);

 lsmeans tbin*fbin;

run;

 

Rather than trying to solve a known problem, I am really trying to check that I have not violated any assumptions, hence chencking the residuals. And also your suggestion of plotting the predicted versus the residual outputs, which defninitely do not have a fan structure. I will also try the QQplots.

Rick_SAS
SAS Super FREQ

Regarding your friend's suggestion to subsample, I think that would not be helpful. If the subsamples are size 100,000, all subsamples will reject normality. If the subsamples are size 5, every subsample will accept normality.  For some value in between (500?) you might get 50% of samples reject and 50% accept.  

 

Look at the normal Q-Q plot, which will graphially indicate whether the data are approximately normal:

 

proc univariate normal;

QQPLOT x / normal;

run;

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 3 replies
  • 1289 views
  • 2 likes
  • 2 in conversation