SAS Data Science

okla · Posted 11-26-2017 11:07 PM

Hi,

I have the following distribution, and odd ratio for an event (taking up a certain product).

From my understanding of odd ratio, it should be interpreted as

1) Those with NoCreditCard is more likely to take up the product vs WithCreditCard_WithTxn

2) Those with WithCreditCard_WithTxn is more likely to take up the product vs WithCreditCard_NoTxn

but why is my actual event distribution (Observed data) show that those with "No Credit Card" has only 19% that takeup the product, which is the lowest of all..... Did i read the odd ratio wrongly?

	Base Distribution	Event
No Credit Card	34%	19%
WithCreditCard_NoTxn	55%	23%
WithCreditCard_WithTxn	11%	31%

	Point Estimate
NoCreditCard vs WithCC_WithTxn	1.196
WithCC_NoTxn vs WithCC_WithTxn	0.839

Thanks.

Mei.

Reeza · Posted 11-27-2017 11:51 PM

@okla wrote:

Q1) Why the Odd ratio for A with reference to male change? Shouldn't my Odd ratio for female still the same as (A) above? Since it is still comparing female and male (ie the odd of female to buy ice cream if increase by one unit)?

Think of it as a linear regression and adding a new variable, the parameters would change. Since the parameters have changed, the odds ratio changed.

Q2) Say (A) above is < 1, is it possible that once i added "TodayWeather", the odd ratio become > 1? Why?

Sounds like this:

https://en.wikipedia.org/wiki/Simpson%27s_paradox

View solution in original post

Reeza · Posted 11-27-2017 12:31 AM

What parameterization method did you use? I suggest including your code and the relevant output directly as well.

okla · Posted 11-27-2017 12:40 AM

Hi,

I am using EM 12.1, and this is the setting for logistic regression.

Reeza · Posted 11-27-2017 12:46 AM

What are your options for INPUT CODING ?

By default SAS uses GLM which is not what a standard text book teaches. I think it’s INPUT CODING but not 100% sure.

okla · Posted 11-27-2017 12:56 AM

Looks like it is input coding. I have changed the setting to use "GLM" instead of "Deviation" (Default), the odd ratio has changed to this

	Point Estimate
NoCreditCard vs WithCC_WithTxn	0.766
WithCC_NoTxn vs WithCC_WithTxn	0.839

Thank you. I must put some research on input coding. Not quite sure how it works.

Reeza · Posted 11-27-2017 01:03 AM

Referential/Ref is the most common. It’s basically creating dummy variables for your categorical variables.

okla · Posted 11-27-2017 11:19 PM

Hi Reeza,

Sorry, coming back to this, now that i understand the different on the referential/ref, i have got a question on odd ratio...

Say, i am trying to predict likelihood to buy ice cream (event =1 , non-event = 0) using logistic regression.

I have only one variable "Gender" (2 value= Male/Female), so, the formula would be like this.

logit(p)=β0+β1∗female

Odd ratio for female with reference to male (A) = odd(female)/odd(male)

Next, I added in "TodayWeather" (2 value = Sun/Rain).

logit(p)=β0+β1∗female+β2*SUN

Q1) Why the Odd ratio for A with reference to male change? Shouldn't my Odd ratio for female still the same as (A) above? Since it is still comparing female and male (ie the odd of female to buy ice cream if increase by one unit)?

Q2) Say (A) above is < 1, is it possible that once i added "TodayWeather", the odd ratio become > 1? Why?

Thank you again.

Reeza · Posted 11-27-2017 11:51 PM

@okla wrote:

Q1) Why the Odd ratio for A with reference to male change? Shouldn't my Odd ratio for female still the same as (A) above? Since it is still comparing female and male (ie the odd of female to buy ice cream if increase by one unit)?

Think of it as a linear regression and adding a new variable, the parameters would change. Since the parameters have changed, the odds ratio changed.

Q2) Say (A) above is < 1, is it possible that once i added "TodayWeather", the odd ratio become > 1? Why?

Sounds like this:

https://en.wikipedia.org/wiki/Simpson%27s_paradox

okla · Posted 11-28-2017 12:22 AM

Hi Reeza, thanks for your speedy reply. Really appreciate it. I think i found my answer. 🙂

AnnaBrown · Posted 11-28-2017 09:47 AM

Hi @okla,

I'm glad you found your answer! If one of the replies was the exact solution to your problem, can you "Accept it as a solution"? Or if one was particularly helpful, feel free to "Like" it. This will help other community members who may run into the same issue know what worked.

Thanks!
Anna

Access SAS Innovate on-demand content now!

SAS Data Science

Odd Ratio Interpretation - with reference

Re: Odd Ratio Interpretation - with reference

Re: Odd Ratio Interpretation - with reference

Re: Odd Ratio Interpretation - with reference

Re: Odd Ratio Interpretation - with reference

Re: Odd Ratio Interpretation - with reference

Re: Odd Ratio Interpretation - with reference

Re: Odd Ratio Interpretation - with reference

Re: Odd Ratio Interpretation - with reference

Re: Odd Ratio Interpretation - with reference

Re: Odd Ratio Interpretation - with reference

Follow Us

What is...

SAS Data Science

Our biggest data and AI event of the year.

Follow Us

What is...