BookmarkSubscribeRSS Feed
abhim701
Calcite | Level 5

I have a data set with dichotomous dependent variable. Can I use Rstudent option in proc reg to identify outliers for a dichotomous dependent variable?

 

proc reg data = work.import;
model Target= x;
output out = outdata rstudent= r;
run;

5 REPLIES 5
Reeza
Super User

Couldn't you look at a histogram or distribution of the variable by the outcome variable?

 

I don't think the R student statistics is designed for categorical variables. 

Ksharp
Super User

No. You pick up wrong model for dichotomous variable.
Check PROC LOGISTIC and OBSSTAT option.

abhim701
Calcite | Level 5

Hi,

 

Could you explain the OBSSTAT option with an example. It would be of great help.

 

Thanks

Abhishek

Ksharp
Super User
Sorry. Check INFLUENCE of model statement.

model ......./ influence

especially check DIFDEV DIFSQUAR


INFLUENCE< (STDRES) >
displays diagnostic measures for identifying influential observations in the case of a binary response
model. For each observation, the INFLUENCE option displays the case number (which is the sequence
number of the observation), the values of the explanatory variables included in the final model, and
the regression diagnostic measures developed by Pregibon (1981). The STDRES option includes
standardized and likelihood residuals in the display.


Reeza
Super User

Look at the INFLUENCE option with proc LOGISTIC on the model statement. The documentation has an example on regression diagnostics including detecting influential observations. 

 

http://support.sas.com/documentation/cdl/en/statug/68162/HTML/default/viewer.htm#statug_logistic_syn...

http://support.sas.com/documentation/cdl/en/statug/68162/HTML/default/viewer.htm#statug_logistic_exa...

 

 

If using proc genmod then obsstat is an option. Both will help with detecting outliers. 

 

 

SAS Innovate 2025: Register Now

Registration is now open for SAS Innovate 2025 , our biggest and most exciting global event of the year! Join us in Orlando, FL, May 6-9.
Sign up by Dec. 31 to get the 2024 rate of just $495.
Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 5 replies
  • 2205 views
  • 6 likes
  • 3 in conversation