I used studentized residuals and Cook's D. I probably could have used a where statement in a proc print if I had more experience. This is my first course using this software and don't have a lot of resources for it. This is the code I used: TITLE "Identifying Outliers & Influential Points"; PROC REG data=PGATour; model ln_prize = GIR BirdieConversion PuttsPerRound/influence r; plot student.*(GIR BirdieConversion PuttsPerRound predicted.); plot npp.*student.; RUN; Influence and r print a table that provides data, but don't show the raw data for the variables. I used the observations from that table to match with the corresponding raw data observations to determine if the datapoints should be removed or not. I was copying and pasting the observations from the raw table into a spreadsheet so I could review the data, but thought there should be a much easier way to do it.
... View more