turn on suggestions

Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results for

Find a Community

- Home
- /
- Analytics
- /
- Data Mining
- /
- Correlation between interval variables and binary...

Topic Options

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Highlight
- Email to a Friend
- Report Inappropriate Content

07-09-2014 12:54 PM

Hi

Could please helmp me on a problem

Which of the following correlation method listed in table analysis node is the right one to test correlation between a continuous variable and a binary variable....

Continuity Adj. Chi-Square |

Mantel-Haenszel Chi-Square |

Phi Coefficient |

Contingency Coefficient |

Cramer's V |

Accepted Solutions

Solution

07-07-2017
01:25 PM

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Highlight
- Email to a Friend
- Report Inappropriate Content

07-07-2017 01:25 PM

The type of correlation you are describing is often referred to as a biserial correlation. There are 3 different types of biserial correlations--biserial, point biserial, and rank biserial. Each of these 3 types of biserial correlations are described in SAS Note 22925. I suspect you need to compute either the biserial or the point biserial correlation. The difference between these two, as described in the aforementioned SAS Note, depends on the binary variable. If the binary variable has an underlying continuous distribution, but is measured as binary, then you should compute a "biserial correlation." If the binary variable is truly dichotomous, then a "point biserial correlation" should be used. (The "rank biserial correlation" measures the relationship between a binary variable and a rankings (ie. ordinal) variable.)

If your binary variables are truly dichotomous (as opposed to discretized continuous variables), then you can compute the point biserial correlations directly in PROC CORR. The point biserial correlation is equivalent to the Pearson product moment correlation between two variables where the dichotomous variable is given any two numeric values. This information is also mentioned in our FASTats link under Correlation> Point Biserial. PROC CORR prints the Pearson product moment correlation by default, so no additional options are required.

If your binary variables are dichotomized continuous variables, then you will need to compute biserial correlations between each of these binary variables and your continuous variable. These correlations are only available through our %BISERIAL macro. SAS Note 24991 describes this macro and includes the source code for the macro in the Downloads tab.

All Replies

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Highlight
- Email to a Friend
- Report Inappropriate Content

07-09-2014 01:33 PM

Hey Omer,

Here a great resource that summarizes statistical tests and how to code them in SAS.

Choosing the Correct Statistical Test in SAS, Stata and SPSS

I hope it helps,

Miguel

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Highlight
- Email to a Friend
- Report Inappropriate Content

07-10-2014 02:30 AM

miguel this is truly a perfect guide for my problem

thank you very much for sharing the page with me...

Solution

07-07-2017
01:25 PM

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Highlight
- Email to a Friend
- Report Inappropriate Content

07-07-2017 01:25 PM

The type of correlation you are describing is often referred to as a biserial correlation. There are 3 different types of biserial correlations--biserial, point biserial, and rank biserial. Each of these 3 types of biserial correlations are described in SAS Note 22925. I suspect you need to compute either the biserial or the point biserial correlation. The difference between these two, as described in the aforementioned SAS Note, depends on the binary variable. If the binary variable has an underlying continuous distribution, but is measured as binary, then you should compute a "biserial correlation." If the binary variable is truly dichotomous, then a "point biserial correlation" should be used. (The "rank biserial correlation" measures the relationship between a binary variable and a rankings (ie. ordinal) variable.)

If your binary variables are truly dichotomous (as opposed to discretized continuous variables), then you can compute the point biserial correlations directly in PROC CORR. The point biserial correlation is equivalent to the Pearson product moment correlation between two variables where the dichotomous variable is given any two numeric values. This information is also mentioned in our FASTats link under Correlation> Point Biserial. PROC CORR prints the Pearson product moment correlation by default, so no additional options are required.

If your binary variables are dichotomized continuous variables, then you will need to compute biserial correlations between each of these binary variables and your continuous variable. These correlations are only available through our %BISERIAL macro. SAS Note 24991 describes this macro and includes the source code for the macro in the Downloads tab.