BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
KavitaShah
Calcite | Level 5

Hello,

 

We are trying to analyze an open source survey data to find a correlation between categorical dependent and independent variables. We have performed chi square tests to test the null hypotheses and also would like to perform logistic regression to find a correlation between these variables. However, I understand that logistic regression is predictor model, more specifically, binary classifier model which means that it tries to assign exactly one of the 2 classes to every observation and it produces probability with which each observation belongs to each class. It can be more accurately applied to variables which have exactly 2 answers like yes or no; survived or dead; male or female and such observations.

 

So now, according to our data, this does not hold true as we have many possible answers to the dependent variable like fair, poor, good, very good and excellent. However, we are considering only 2 answers namely poor and fair for our analysis.

 

Can we use logistic regression for correlation of such a data-set? We have also used Spearman Rank Correlation Coefficient to determine the correlation

 

Also, is there any way to correlate the 'c value' from the logistic regression and the Spearman Rank Correlation Coefficient?

 

Any guidance on this is highly appreciated.

 

Thank you in advance.

 

-KS.

1 ACCEPTED SOLUTION

Accepted Solutions
StatDave
SAS Super FREQ

In addition to the binomial (2-level) response, logistic regression models can be applied to multinomial (ordinal or nominal) responses that have several levels. SAS offers several procedures that can fit all of these models. See this note that details these and other types of logistic models and the procedures that can be used. For example, the LOGISTIC and SURVEYLOGISTIC procedures can be used to fit binomial and multinomial response models. If you have survey data, then you will probably want to look at PROC SURVEYLOGISTIC. These modeling procedures do not deal with "correlation" in the simple two variable sense. But the tests of the predictors in the model provide you with tests of the association of each predictor with the response after adjusting for the effects of the other predictors.

View solution in original post

6 REPLIES 6
Reeza
Super User

There's multinomial logistic regression as well or ordinal logistic regression which are more suited to your question.

 

If you look at the documentation for PROC LOGISTIC you'll find the example section has some examples. Your best bet is to walk through those examples. 

 

You may also want to consider PLS regression methods.

 

 

KavitaShah
Calcite | Level 5
Thank you for the suggestion.
StatDave
SAS Super FREQ

In addition to the binomial (2-level) response, logistic regression models can be applied to multinomial (ordinal or nominal) responses that have several levels. SAS offers several procedures that can fit all of these models. See this note that details these and other types of logistic models and the procedures that can be used. For example, the LOGISTIC and SURVEYLOGISTIC procedures can be used to fit binomial and multinomial response models. If you have survey data, then you will probably want to look at PROC SURVEYLOGISTIC. These modeling procedures do not deal with "correlation" in the simple two variable sense. But the tests of the predictors in the model provide you with tests of the association of each predictor with the response after adjusting for the effects of the other predictors.

KavitaShah
Calcite | Level 5

Thank you for your inputs. i am working through the suggestions you have given.

Ksharp
Super User

PROC ADAPTIVEREG and PROC GAMPL also can do logistic regression.

KavitaShah
Calcite | Level 5

Sure, thank you.

Ready to join fellow brilliant minds for the SAS Hackathon?

Build your skills. Make connections. Enjoy creative freedom. Maybe change the world. Registration is now open through August 30th. Visit the SAS Hackathon homepage.

Register today!
What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 6 replies
  • 1972 views
  • 2 likes
  • 4 in conversation