<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: proc logistic with too many zeroes in SAS Procedures</title>
    <link>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582119#M75680</link>
    <description>So HostA and Bacteria A are always the same? And all others are zero?</description>
    <pubDate>Mon, 19 Aug 2019 14:51:43 GMT</pubDate>
    <dc:creator>Reeza</dc:creator>
    <dc:date>2019-08-19T14:51:43Z</dc:date>
    <item>
      <title>proc logistic with too many zeroes</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/581959#M75659</link>
      <description>&lt;P&gt;Hello,&lt;BR /&gt;I need your advice on what type of model to use for my data that is not behaving according to planned.&lt;BR /&gt;The study is about bacterial introduction into several hosts (experiments performed on flies).&lt;/P&gt;
&lt;P&gt;The data was collected as "0" (fail-to-introduce) and "1" (introduced). The logistic regression (binomial distribution) was run on the data.&lt;BR /&gt;The bacteria and hosts were treated as class variables with 5 categories each. The model includes the two variables and their interaction.&lt;BR /&gt;However, one of the many problems of the analysis is that the logistic output shows that the model may not a good fit for the data.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Deviance and Pearson Goodness-of-Fit Statistics &lt;BR /&gt;Criterion&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Value&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; DF&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Value/DF&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Pr &amp;gt; ChiSq &lt;BR /&gt;Deviance&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 256.2352&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 200 &amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 1.2812&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.0044 &lt;BR /&gt;Pearson&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 223.6127&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 200 &amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 1.1181&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0.1210&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The diagnostic plots show many points that are not fitting in the model:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="SasCommuties_diag1.png" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/31829iDED08010A206E409/image-size/large?v=v2&amp;amp;px=999" role="button" title="SasCommuties_diag1.png" alt="SasCommuties_diag1.png" /&gt;&lt;/span&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="SasCommuties_diag2.png" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/31830iEDFB72B4E5FA7F5C/image-size/large?v=v2&amp;amp;px=999" role="button" title="SasCommuties_diag2.png" alt="SasCommuties_diag2.png" /&gt;&lt;/span&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="SasCommuties_diag3.png" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/31831iF20D5A76016EC40A/image-size/large?v=v2&amp;amp;px=999" role="button" title="SasCommuties_diag3.png" alt="SasCommuties_diag3.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I checked the distribution of frequencies of the response and I see that there is a high percentage of zeroes. It seems that this is the likely origin of the problem withe the binomial distribution I used with the logistic regression&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="SAS_Communities_1.jpg" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/31832i22845E81E212511F/image-size/large?v=v2&amp;amp;px=999" role="button" title="SAS_Communities_1.jpg" alt="SAS_Communities_1.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Can anyone suggest me how to deal with this type of data?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thank you in advance.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 17 Aug 2019 19:11:42 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/581959#M75659</guid>
      <dc:creator>igforek</dc:creator>
      <dc:date>2019-08-17T19:11:42Z</dc:date>
    </item>
    <item>
      <title>Re: proc logistic with too many zeroes</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/581963#M75660</link>
      <description>You only have categorical data so you should expect clusters in your data because the fit stats are expecting to have some continuous variables. Instead of PROC LOGISTIC you could consider trying a categorical procedure such as CATMOD. &lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Sat, 17 Aug 2019 19:21:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/581963#M75660</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2019-08-17T19:21:31Z</dc:date>
    </item>
    <item>
      <title>Re: proc logistic with too many zeroes</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/581966#M75661</link>
      <description>Thanks Reeza,&lt;BR /&gt;I will check it out right now.&lt;BR /&gt;</description>
      <pubDate>Sat, 17 Aug 2019 19:41:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/581966#M75661</guid>
      <dc:creator>igforek</dc:creator>
      <dc:date>2019-08-17T19:41:31Z</dc:date>
    </item>
    <item>
      <title>Re: proc logistic with too many zeroes</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/581968#M75662</link>
      <description>Hi Reeza,&lt;BR /&gt;Just one more thing. I have continuous variables that I originally included in my logistic regression, but removed it after model selection. Even when I put back the continuous variables in the model the problem persists.&lt;BR /&gt;Any other suggestions are welcome</description>
      <pubDate>Sat, 17 Aug 2019 20:03:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/581968#M75662</guid>
      <dc:creator>igforek</dc:creator>
      <dc:date>2019-08-17T20:03:39Z</dc:date>
    </item>
    <item>
      <title>Re: proc logistic with too many zeroes</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/581974#M75663</link>
      <description>Of course it does, because you still have categorical data as a part of your model, so you have to expect clumps in the outputs. You only have so many values to check and they'll often be the same.</description>
      <pubDate>Sat, 17 Aug 2019 22:41:15 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/581974#M75663</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2019-08-17T22:41:15Z</dc:date>
    </item>
    <item>
      <title>Re: proc logistic with too many zeroes</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/581977#M75664</link>
      <description>Thank you for your comments. I just checked the logistic with continuous variables ONLY. It does not solve the problem.&lt;BR /&gt;With categorical variable only, I had a quasi separation the warnings:&lt;BR /&gt;&lt;BR /&gt;Quasi-complete separation of data points detected. &lt;BR /&gt;Warning: The maximum likelihood estimate may not exist. &lt;BR /&gt;Warning: The LOGISTIC procedure continues in spite of the above warning. Results shown are based on the last maximum likelihood iteration. Validity of the model fit is questionable. &lt;BR /&gt;&lt;BR /&gt;The Firth correction made the warning go away.&lt;BR /&gt;</description>
      <pubDate>Sat, 17 Aug 2019 23:41:41 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/581977#M75664</guid>
      <dc:creator>igforek</dc:creator>
      <dc:date>2019-08-17T23:41:41Z</dc:date>
    </item>
    <item>
      <title>Re: proc logistic with too many zeroes</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/581983#M75665</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/48359"&gt;@igforek&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;Thank you for your comments. I just checked the logistic with continuous variables ONLY. It does not solve the problem.&lt;BR /&gt;With categorical variable only, I had a quasi separation the warnings:&lt;BR /&gt;&lt;BR /&gt;Quasi-complete separation of data points detected. &lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You had not mentioned that before.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;How did you create your categories and categorical values? Did you check them against your outcome variable using PROC FREQ?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;It seems like you have a few that overlap - ie one level in one group matches all the records of one level in another group. You're looking for lines of zeros or near zeros in your proc freq except for one column/row. Those are the categories that are causing you issues.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 03:18:36 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/581983#M75665</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2019-08-18T03:18:36Z</dc:date>
    </item>
    <item>
      <title>Re: proc logistic with too many zeroes</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582004#M75668</link>
      <description>&lt;P&gt;It is called unbalanced data problem.&lt;/P&gt;
&lt;P&gt;Try oversampling ,lift up 0:1 percent be&amp;nbsp; 1:1 or 5:1 , otherwise LOGISTIC model have low power .&lt;/P&gt;
&lt;P&gt;and use OFFSET= option to adjust the prediction proability .&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 11:40:36 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582004#M75668</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2019-08-18T11:40:36Z</dc:date>
    </item>
    <item>
      <title>Re: proc logistic with too many zeroes</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582007#M75669</link>
      <description>&lt;P&gt;Below is a condensed matrix of the data.&lt;/P&gt;
&lt;P&gt;Each host has a natural bacteria. The bacteria were introduced into new hosts as well as into their native host ("self-introduced"). The diagonal shows the results of "self-introduction"&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;TABLE style="border-collapse: collapse; width: 288pt;" width="384" cellspacing="0" cellpadding="0" border="0"&gt;
&lt;TBODY&gt;
&lt;TR style="height: 15.0pt;"&gt;
&lt;TD class="xl65" style="height: 15.0pt; width: 48pt;" width="64" height="20"&gt;&amp;nbsp;&lt;/TD&gt;
&lt;TD class="xl65" style="width: 48pt;" width="64"&gt;HostA&lt;/TD&gt;
&lt;TD class="xl65" style="width: 48pt;" width="64"&gt;HostB&lt;/TD&gt;
&lt;TD class="xl65" style="width: 48pt;" width="64"&gt;HostC&lt;/TD&gt;
&lt;TD class="xl65" style="width: 48pt;" width="64"&gt;HostD&lt;/TD&gt;
&lt;TD class="xl65" style="width: 48pt;" width="64"&gt;HostE&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR style="height: 15.0pt;"&gt;
&lt;TD class="xl65" style="height: 15.0pt;" height="20"&gt;BactA&lt;/TD&gt;
&lt;TD class="xl66"&gt;#&lt;/TD&gt;
&lt;TD class="xl66"&gt;#&lt;/TD&gt;
&lt;TD class="xl68"&gt;0.01&lt;/TD&gt;
&lt;TD class="xl66"&gt;#&lt;/TD&gt;
&lt;TD class="xl66"&gt;#&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR style="height: 15.0pt;"&gt;
&lt;TD class="xl65" style="height: 15.0pt;" height="20"&gt;BactB&lt;/TD&gt;
&lt;TD class="xl67"&gt;0.00&lt;/TD&gt;
&lt;TD class="xl66"&gt;#&lt;/TD&gt;
&lt;TD class="xl66"&gt;#&lt;/TD&gt;
&lt;TD class="xl66"&gt;#&lt;/TD&gt;
&lt;TD class="xl69"&gt;0.10&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR style="height: 15.0pt;"&gt;
&lt;TD class="xl65" style="height: 15.0pt;" height="20"&gt;BactC&lt;/TD&gt;
&lt;TD class="xl67"&gt;0.00&lt;/TD&gt;
&lt;TD class="xl66"&gt;#&lt;/TD&gt;
&lt;TD class="xl66"&gt;#&lt;/TD&gt;
&lt;TD class="xl69"&gt;0.10&lt;/TD&gt;
&lt;TD class="xl67"&gt;0.00&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR style="height: 15.0pt;"&gt;
&lt;TD class="xl65" style="height: 15.0pt;" height="20"&gt;BactD&lt;/TD&gt;
&lt;TD class="xl67"&gt;0.00&lt;/TD&gt;
&lt;TD class="xl66"&gt;#&lt;/TD&gt;
&lt;TD class="xl66"&gt;#&lt;/TD&gt;
&lt;TD class="xl66"&gt;#&lt;/TD&gt;
&lt;TD class="xl66"&gt;#&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR style="height: 15.0pt;"&gt;
&lt;TD class="xl65" style="height: 15.0pt;" height="20"&gt;BactE&lt;/TD&gt;
&lt;TD class="xl67"&gt;0.00&lt;/TD&gt;
&lt;TD class="xl66"&gt;#&lt;/TD&gt;
&lt;TD class="xl68"&gt;0.01&lt;/TD&gt;
&lt;TD class="xl66"&gt;#&lt;/TD&gt;
&lt;TD class="xl66"&gt;0.00&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The symbol "#" represents proportion (of infected&amp;nbsp; individuals) that are different from each other in the whole matrix.&lt;/P&gt;
&lt;P&gt;There are two cells with value 0.01 that are equal.&lt;/P&gt;
&lt;P&gt;There are two cells with value 0.10 that are equal.&lt;/P&gt;
&lt;P&gt;The Zeroes are real data.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I tried to run a model with:&lt;/P&gt;
&lt;P&gt;PropInf = Bact + Host +Bact*Host&amp;nbsp;&amp;nbsp; in Genmod but I get the warning:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;"WARNING: Negative of Hessian not positive definite"&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I think he many zeroes from HostA may be causing such a problem.&lt;/P&gt;
&lt;P&gt;The model PropInf = Bact + Host ran ok in genmod.&lt;/P&gt;
&lt;DIV class="branch"&gt;
&lt;DIV&gt;
&lt;DIV align="center"&gt;
&lt;TABLE class="table" summary="Procedure Genmod: Convergence Status" frame="box" rules="all" cellspacing="0" cellpadding="5"&gt;
&lt;TBODY&gt;
&lt;TR&gt;
&lt;TD class="l data"&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 14:34:10 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582007#M75669</guid>
      <dc:creator>igforek</dc:creator>
      <dc:date>2019-08-18T14:34:10Z</dc:date>
    </item>
    <item>
      <title>Re: proc logistic with too many zeroes</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582008#M75670</link>
      <description>&lt;P&gt;I am using HostA and BactA as the Reference categories, with GLM coding.&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 14:35:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582008#M75670</guid>
      <dc:creator>igforek</dc:creator>
      <dc:date>2019-08-18T14:35:31Z</dc:date>
    </item>
    <item>
      <title>Re: proc logistic with too many zeroes</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582025#M75671</link>
      <description>&lt;P&gt;This is the frequency of response values per each bacteria and host:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Post_to_SAS_communities_freqbactHost.jpg" style="width: 600px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/31833i15145FE04FEE5C03/image-size/large?v=v2&amp;amp;px=999" role="button" title="Post_to_SAS_communities_freqbactHost.jpg" alt="Post_to_SAS_communities_freqbactHost.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 20:11:01 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582025#M75671</guid>
      <dc:creator>igforek</dc:creator>
      <dc:date>2019-08-18T20:11:01Z</dc:date>
    </item>
    <item>
      <title>Re: proc logistic with too many zeroes</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582039#M75676</link>
      <description>&lt;P&gt;The measures of pairwise relatedness between bacteria are represented by the matrix below:&lt;/P&gt;
&lt;TABLE style="border-collapse: collapse; width: 288pt;" width="384" cellspacing="0" cellpadding="0" border="0"&gt;
&lt;TBODY&gt;
&lt;TR style="height: 15.0pt;"&gt;
&lt;TD class="xl65" style="height: 15.0pt; width: 48pt;" width="64" height="20"&gt;&amp;nbsp;&lt;/TD&gt;
&lt;TD class="xl65" style="width: 48pt;" width="64"&gt;BactA&lt;/TD&gt;
&lt;TD class="xl65" style="width: 48pt;" width="64"&gt;BactB&lt;/TD&gt;
&lt;TD class="xl65" style="width: 48pt;" width="64"&gt;BactC&lt;/TD&gt;
&lt;TD class="xl65" style="width: 48pt;" width="64"&gt;BactD&lt;/TD&gt;
&lt;TD class="xl65" style="width: 48pt;" width="64"&gt;BactE&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR style="height: 15.0pt;"&gt;
&lt;TD class="xl65" style="height: 15.0pt;" height="20"&gt;BactA&lt;/TD&gt;
&lt;TD class="xl65"&gt;0&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR style="height: 15.0pt;"&gt;
&lt;TD class="xl65" style="height: 15.0pt;" height="20"&gt;BactB&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;0&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;0&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR style="height: 15.0pt;"&gt;
&lt;TD class="xl65" style="height: 15.0pt;" height="20"&gt;BactC&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;0&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR style="height: 15.0pt;"&gt;
&lt;TD class="xl65" style="height: 15.0pt;" height="20"&gt;BactD&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;0&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;0&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR style="height: 15.0pt;"&gt;
&lt;TD class="xl65" style="height: 15.0pt;" height="20"&gt;BactE&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;0&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;There are some zeroes outside the diagonal&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The measures of pairwise relatedness between hosts are represented by the matrix below:&lt;/P&gt;
&lt;TABLE style="border-collapse: collapse; width: 288pt;" width="384" cellspacing="0" cellpadding="0" border="0"&gt;
&lt;TBODY&gt;
&lt;TR style="height: 15.0pt;"&gt;
&lt;TD class="xl65" style="height: 15.0pt; width: 48pt;" width="64" height="20"&gt;&amp;nbsp;&lt;/TD&gt;
&lt;TD class="xl65" style="width: 48pt;" width="64"&gt;HostA&lt;/TD&gt;
&lt;TD class="xl65" style="width: 48pt;" width="64"&gt;HostB&lt;/TD&gt;
&lt;TD class="xl65" style="width: 48pt;" width="64"&gt;HostC&lt;/TD&gt;
&lt;TD class="xl65" style="width: 48pt;" width="64"&gt;HostD&lt;/TD&gt;
&lt;TD class="xl65" style="width: 48pt;" width="64"&gt;HostE&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR style="height: 15.0pt;"&gt;
&lt;TD class="xl65" style="height: 15.0pt;" height="20"&gt;HostA&lt;/TD&gt;
&lt;TD class="xl65"&gt;0&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR style="height: 15.0pt;"&gt;
&lt;TD class="xl65" style="height: 15.0pt;" height="20"&gt;HostB&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;0&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR style="height: 15.0pt;"&gt;
&lt;TD class="xl65" style="height: 15.0pt;" height="20"&gt;HostC&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;0&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR style="height: 15.0pt;"&gt;
&lt;TD class="xl65" style="height: 15.0pt;" height="20"&gt;HostD&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;0&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR style="height: 15.0pt;"&gt;
&lt;TD class="xl65" style="height: 15.0pt;" height="20"&gt;HostE&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;#&lt;/TD&gt;
&lt;TD class="xl65"&gt;0&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;No zeroes outside the diagonal.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;These measures of relatedness were used as continuous variables, but the logistic stepwise selection kept them out form the final model.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 19 Aug 2019 02:00:57 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582039#M75676</guid>
      <dc:creator>igforek</dc:creator>
      <dc:date>2019-08-19T02:00:57Z</dc:date>
    </item>
    <item>
      <title>Re: proc logistic with too many zeroes</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582119#M75680</link>
      <description>So HostA and Bacteria A are always the same? And all others are zero?</description>
      <pubDate>Mon, 19 Aug 2019 14:51:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582119#M75680</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2019-08-19T14:51:43Z</dc:date>
    </item>
    <item>
      <title>Re: proc logistic with too many zeroes</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582126#M75684</link>
      <description>All bacteria (except BactA) "Fail" to be introduced into HostA. The proportion of success for the introduction of BactB, BactC, BactD and BactE into HostA is zero. The natural bacteria of HostA, named BactA in the table, is the only one that it is "introduced" successfully (The symbol "#" represents a proportion different from Zero).</description>
      <pubDate>Mon, 19 Aug 2019 15:02:34 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582126#M75684</guid>
      <dc:creator>igforek</dc:creator>
      <dc:date>2019-08-19T15:02:34Z</dc:date>
    </item>
    <item>
      <title>Re: proc logistic with too many zeroes</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582136#M75685</link>
      <description>BactA is successfully introduced into other hosts (HostB, HostC, HostD, HostE) with various proportions of success.</description>
      <pubDate>Mon, 19 Aug 2019 15:41:46 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582136#M75685</guid>
      <dc:creator>igforek</dc:creator>
      <dc:date>2019-08-19T15:41:46Z</dc:date>
    </item>
    <item>
      <title>Re: proc logistic with too many zeroes</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582140#M75686</link>
      <description>I suspect that's the one that's causing the quasi separation issue - because it does separate the data quite cleanly.</description>
      <pubDate>Mon, 19 Aug 2019 15:47:10 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582140#M75686</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2019-08-19T15:47:10Z</dc:date>
    </item>
    <item>
      <title>Re: proc logistic with too many zeroes</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582153#M75687</link>
      <description>&lt;P&gt;It may be so.&lt;BR /&gt;Also, when I try to run genmod with the factors Bact Host and their interaction, the warning about the Hessian matrix not being positive definite may be caused by that many zeroes.&lt;BR /&gt;The model runs ok with no interaction.&lt;/P&gt;</description>
      <pubDate>Mon, 19 Aug 2019 16:06:16 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582153#M75687</guid>
      <dc:creator>igforek</dc:creator>
      <dc:date>2019-08-19T16:06:16Z</dc:date>
    </item>
    <item>
      <title>Re: proc logistic with too many zeroes</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582156#M75688</link>
      <description>&lt;P&gt;I took a look at this page: &lt;A href="https://support.sas.com/rnd/app/stat/examples/GENMODZIP/roots.htm" target="_blank" rel="noopener"&gt;https://support.sas.com/rnd/app/stat/examples/GENMODZIP/roots.htm&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;but I am not sure if this will apply to my data, because I have categorical independent variables. And binomial.&lt;/P&gt;</description>
      <pubDate>Mon, 19 Aug 2019 16:13:33 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582156#M75688</guid>
      <dc:creator>igforek</dc:creator>
      <dc:date>2019-08-19T16:13:33Z</dc:date>
    </item>
    <item>
      <title>Re: proc logistic with too many zeroes</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582203#M75691</link>
      <description>HostB is naturally infected by BactB&lt;BR /&gt;HostC is naturally infected by BactC&lt;BR /&gt;and so on&lt;BR /&gt;BactE is naturally infected by BactE</description>
      <pubDate>Mon, 19 Aug 2019 18:18:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582203#M75691</guid>
      <dc:creator>igforek</dc:creator>
      <dc:date>2019-08-19T18:18:29Z</dc:date>
    </item>
    <item>
      <title>Re: proc logistic with too many zeroes</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582833#M75701</link>
      <description>&lt;P&gt;Mr. &lt;SPAN class="UserName lia-user-name lia-user-rank-Grand-Advisor"&gt;&lt;A id="link_8" class="lia-link-navigation lia-page-link lia-user-name-link" style="color: #007dc3;" href="https://communities.sas.com/t5/user/viewprofilepage/user-id/18408" target="_self"&gt;&lt;SPAN class="login-bold"&gt;Ksharp,&lt;/SPAN&gt;&lt;/A&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="UserName lia-user-name lia-user-rank-Grand-Advisor"&gt;&lt;SPAN class="login-bold"&gt;I thought that unbalanced data meant that there were too many observations in one class, as compared to another class in an analysis.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="UserName lia-user-name lia-user-rank-Grand-Advisor"&gt;&lt;SPAN class="login-bold"&gt;Are you suggesting that too many zeros in one class can be a special case of unbalanced data?&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN class="UserName lia-user-name lia-user-rank-Grand-Advisor"&gt;&lt;SPAN class="login-bold"&gt;Regards,&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 21 Aug 2019 14:52:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/proc-logistic-with-too-many-zeroes/m-p/582833#M75701</guid>
      <dc:creator>igforek</dc:creator>
      <dc:date>2019-08-21T14:52:43Z</dc:date>
    </item>
  </channel>
</rss>

