BookmarkSubscribeRSS Feed
hayesk
Calcite | Level 5

Hello! I have data for a sample of 74 people who answered a 97-question assessment utilizing a likert scale. Looking at the data visually, we noticed a few responses that look like outliers/careless responses and we are considering removing them from the dataset. We want to do a formal outlier analysis to justify this. I read that Mahalanobis distance can be used to identify outliers but I don't know how to do it in SAS and with my variables. The 97-item assessment is broken up into subscales with items belonging to only one subscale. Would I calculate the means for each of the subscales and then use those values to calculate Mahalanobis' distance between the individual's mean on the subscales and the sample's means? What would the specific syntax be for this?

2 REPLIES 2

hackathon24-white-horiz.png

The 2025 SAS Hackathon Kicks Off on June 11!

Watch the live Hackathon Kickoff to get all the essential information about the SAS Hackathon—including how to join, how to participate, and expert tips for success.

YouTube LinkedIn

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 1317 views
  • 0 likes
  • 3 in conversation