<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Odds Ratio from subgroup analysis using logistic model with quasi-complete separation of data point in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Odds-Ratio-from-subgroup-analysis-using-logistic-model-with/m-p/482020#M25063</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have a model y (disease status with 0 or 1) = sex + treatment, but there is no events for female at treatment B as below:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Disease&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Treatment A&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Treatment B&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Female&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;1&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;1&lt;/FONT&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;0&lt;/FONT&gt;&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;0&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;8&lt;/FONT&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;18&lt;/FONT&gt;&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Male&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;1&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;9&lt;/FONT&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;5&lt;/FONT&gt;&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;0&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;87&lt;/FONT&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;86&lt;/FONT&gt;&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Below code was used:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;&lt;STRONG&gt;proc&lt;/STRONG&gt; &lt;STRONG&gt;logistic&lt;/STRONG&gt; data=xxx ;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;Class sex treatment ;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;model disease(event="1") = treatment sex sex*treatment;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;oddsratio treatmen/diff=ref;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;&lt;STRONG&gt;run&lt;/STRONG&gt;;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I got warning message from SAS:&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;WARNING: There is possibly a quasi-complete separation of data points. The maximum likelihood&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; estimate may not exist.&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;WARNING: The LOGISTIC procedure continues in spite of the above warning. Results shown are based&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; on the last maximum likelihood iteration. Validity of the model fit is questionable.&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Meanwhile SAS reported ORs (95% CI):&lt;/P&gt;&lt;P&gt;Female: &lt;FONT color="#0000ff"&gt;&amp;lt;0.001 (&amp;lt;0.001 &amp;gt;999.999)&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;Male: &lt;FONT color="#0000ff"&gt;1.55(0.48,5.01)&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I understand OR and CI are not making sense for Female, but how about OR and CI for Male, whether it is still making sense to interpret it? Any suggestion for this situation if I do want to keep both factors in the model?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you very much for your help!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Nancy&lt;/P&gt;</description>
    <pubDate>Fri, 27 Jul 2018 19:53:55 GMT</pubDate>
    <dc:creator>Nancych</dc:creator>
    <dc:date>2018-07-27T19:53:55Z</dc:date>
    <item>
      <title>Odds Ratio from subgroup analysis using logistic model with quasi-complete separation of data point</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Odds-Ratio-from-subgroup-analysis-using-logistic-model-with/m-p/482020#M25063</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have a model y (disease status with 0 or 1) = sex + treatment, but there is no events for female at treatment B as below:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Disease&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Treatment A&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;Treatment B&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Female&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;1&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;1&lt;/FONT&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;0&lt;/FONT&gt;&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;0&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;8&lt;/FONT&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;18&lt;/FONT&gt;&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;Male&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;1&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;9&lt;/FONT&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;5&lt;/FONT&gt;&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;0&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;87&lt;/FONT&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;86&lt;/FONT&gt;&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Below code was used:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;&lt;STRONG&gt;proc&lt;/STRONG&gt; &lt;STRONG&gt;logistic&lt;/STRONG&gt; data=xxx ;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;Class sex treatment ;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;model disease(event="1") = treatment sex sex*treatment;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;oddsratio treatmen/diff=ref;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;&lt;STRONG&gt;run&lt;/STRONG&gt;;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I got warning message from SAS:&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;WARNING: There is possibly a quasi-complete separation of data points. The maximum likelihood&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; estimate may not exist.&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;WARNING: The LOGISTIC procedure continues in spite of the above warning. Results shown are based&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#0000ff"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; on the last maximum likelihood iteration. Validity of the model fit is questionable.&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Meanwhile SAS reported ORs (95% CI):&lt;/P&gt;&lt;P&gt;Female: &lt;FONT color="#0000ff"&gt;&amp;lt;0.001 (&amp;lt;0.001 &amp;gt;999.999)&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;Male: &lt;FONT color="#0000ff"&gt;1.55(0.48,5.01)&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I understand OR and CI are not making sense for Female, but how about OR and CI for Male, whether it is still making sense to interpret it? Any suggestion for this situation if I do want to keep both factors in the model?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you very much for your help!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Nancy&lt;/P&gt;</description>
      <pubDate>Fri, 27 Jul 2018 19:53:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Odds-Ratio-from-subgroup-analysis-using-logistic-model-with/m-p/482020#M25063</guid>
      <dc:creator>Nancych</dc:creator>
      <dc:date>2018-07-27T19:53:55Z</dc:date>
    </item>
    <item>
      <title>Re: Odds Ratio from subgroup analysis using logistic model with quasi-complete separation of data po</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Odds-Ratio-from-subgroup-analysis-using-logistic-model-with/m-p/482024#M25064</link>
      <description>&lt;P&gt;Yes, you can still use the estimates for the male variable.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You can also look at exact logistic regression as well, and/or adding a small value to the B to see what happens - the estimates shouldn't change.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I don't remember the exact reason why, but I remember looking into in (years back) and seeing that it was valid. It had to do with it being to small to affect results in the end.&lt;/P&gt;</description>
      <pubDate>Fri, 27 Jul 2018 20:06:34 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Odds-Ratio-from-subgroup-analysis-using-logistic-model-with/m-p/482024#M25064</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2018-07-27T20:06:34Z</dc:date>
    </item>
    <item>
      <title>Re: Odds Ratio from subgroup analysis using logistic model with quasi-complete separation of data po</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Odds-Ratio-from-subgroup-analysis-using-logistic-model-with/m-p/482168#M25066</link>
      <description>&lt;P&gt;Did you try PROC CATMOD ?&lt;/P&gt;</description>
      <pubDate>Sat, 28 Jul 2018 11:09:57 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Odds-Ratio-from-subgroup-analysis-using-logistic-model-with/m-p/482168#M25066</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2018-07-28T11:09:57Z</dc:date>
    </item>
    <item>
      <title>Re: Odds Ratio from subgroup analysis using logistic model with quasi-complete separation of data po</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Odds-Ratio-from-subgroup-analysis-using-logistic-model-with/m-p/482184#M25068</link>
      <description>&lt;P&gt;using your code, those cell counts generate slightly different results than you report. The treatment odds ratio for:&lt;/P&gt;
&lt;P&gt;female: &amp;gt;999.999&lt;/P&gt;
&lt;P&gt;male: 1.779 (0.573, 5.525)&lt;/P&gt;
&lt;P&gt;When you have separation problems due to sparseness like this, a useful alternative is to use Firth's penalized likelihood method FIRTH option, which gives:&lt;/P&gt;
&lt;P&gt;female: 6.529 (0.214, 198.994)&lt;/P&gt;
&lt;P&gt;male: 1.708 (0.570, 5.116)&lt;/P&gt;
&lt;P&gt;Or, for fairly small sample problems like this, use the exact method:&lt;/P&gt;
&lt;P&gt;female: 2.000 (0.105, infinity)&lt;/P&gt;
&lt;P&gt;male: 1.774 (0.509, 7.023)&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Notice that the treatment odds ratio for males doesn't change much.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The Firth and exact results both provided by this code (remove firth option to see the regular, asymptotic results):&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;data a;
do sex='f','m';
do disease=1,0;
do trt='a','b';
input count @@;
output;
end; end; end;
datalines;
1 0
8 18
9 5
87 86
;
proc logistic;
freq count;
class sex trt/param=ref;
model disease(event="1") = sex trt(sex)/firth;
oddsratio trt/diff=ref;
exact trt(sex)/estimate=both;
run;
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Sat, 28 Jul 2018 16:05:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Odds-Ratio-from-subgroup-analysis-using-logistic-model-with/m-p/482184#M25068</guid>
      <dc:creator>StatDave</dc:creator>
      <dc:date>2018-07-28T16:05:55Z</dc:date>
    </item>
    <item>
      <title>Re: Odds Ratio from subgroup analysis using logistic model with quasi-complete separation of data po</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Odds-Ratio-from-subgroup-analysis-using-logistic-model-with/m-p/482220#M25071</link>
      <description>Thank you, everyone.&lt;BR /&gt;Your suggestions are very very helpful! I might keep as is or using Firth correction.&lt;BR /&gt;Thanks again,&lt;BR /&gt;Nancy</description>
      <pubDate>Sun, 29 Jul 2018 01:39:46 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Odds-Ratio-from-subgroup-analysis-using-logistic-model-with/m-p/482220#M25071</guid>
      <dc:creator>Nancych</dc:creator>
      <dc:date>2018-07-29T01:39:46Z</dc:date>
    </item>
  </channel>
</rss>

