<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Zero Inflated Dependent variable in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Zero-Inflated-Dependent-variable/m-p/420028#M22257</link>
    <description>&lt;P&gt;&lt;SPAN&gt;Hi,&lt;/SPAN&gt;&lt;/P&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;I was performing Linear Regression which is based on E-Commerce Dataset. I was stuck with the following problem.&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;STRONG&gt;Assumption&lt;/STRONG&gt;: In&amp;nbsp;the dataset, I am taking&lt;STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;store_purchase_event_count&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;as a Dependent variable for predicting store_purchase_event_count using OLS Linear Regression.&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;STRONG&gt;Problem&lt;/STRONG&gt;: I am trying to Normalize the Dependent variable but it contains more than 50-60% of zeroes. So, I was not able to figure out how I should move forward with this problem.&lt;/DIV&gt;&lt;DIV&gt;&lt;STRONG&gt;&amp;nbsp;&lt;/STRONG&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;STRONG&gt;Solutions tried:&lt;/STRONG&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;1. Added constant to each value of Y and then taking the log.&lt;/DIV&gt;&lt;DIV&gt;2. Taking the square root of each value.&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;None of the above solutions is making Y variable normal. Please suggest how to move forward&lt;/DIV&gt;</description>
    <pubDate>Mon, 11 Dec 2017 07:55:12 GMT</pubDate>
    <dc:creator>Harman</dc:creator>
    <dc:date>2017-12-11T07:55:12Z</dc:date>
    <item>
      <title>Zero Inflated Dependent variable</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Zero-Inflated-Dependent-variable/m-p/420028#M22257</link>
      <description>&lt;P&gt;&lt;SPAN&gt;Hi,&lt;/SPAN&gt;&lt;/P&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;I was performing Linear Regression which is based on E-Commerce Dataset. I was stuck with the following problem.&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;STRONG&gt;Assumption&lt;/STRONG&gt;: In&amp;nbsp;the dataset, I am taking&lt;STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;store_purchase_event_count&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;as a Dependent variable for predicting store_purchase_event_count using OLS Linear Regression.&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;STRONG&gt;Problem&lt;/STRONG&gt;: I am trying to Normalize the Dependent variable but it contains more than 50-60% of zeroes. So, I was not able to figure out how I should move forward with this problem.&lt;/DIV&gt;&lt;DIV&gt;&lt;STRONG&gt;&amp;nbsp;&lt;/STRONG&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;STRONG&gt;Solutions tried:&lt;/STRONG&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;1. Added constant to each value of Y and then taking the log.&lt;/DIV&gt;&lt;DIV&gt;2. Taking the square root of each value.&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;None of the above solutions is making Y variable normal. Please suggest how to move forward&lt;/DIV&gt;</description>
      <pubDate>Mon, 11 Dec 2017 07:55:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Zero-Inflated-Dependent-variable/m-p/420028#M22257</guid>
      <dc:creator>Harman</dc:creator>
      <dc:date>2017-12-11T07:55:12Z</dc:date>
    </item>
    <item>
      <title>Re: Zero Inflated Dependent variable</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Zero-Inflated-Dependent-variable/m-p/422924#M22258</link>
      <description>&lt;P&gt;Moved question to "&amp;nbsp;SAS Statistical Procedures"&lt;/P&gt;</description>
      <pubDate>Thu, 21 Dec 2017 01:05:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Zero-Inflated-Dependent-variable/m-p/422924#M22258</guid>
      <dc:creator>ChrisNZ</dc:creator>
      <dc:date>2017-12-21T01:05:55Z</dc:date>
    </item>
    <item>
      <title>Re: Zero Inflated Dependent variable</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Zero-Inflated-Dependent-variable/m-p/425012#M22354</link>
      <description>&lt;P&gt;I assume that your response is positively valued except for the zeros. If that is correct, and if the values are all integers (like a count: 0, 1, 2, 3, ...), then you can fit a zero-inflated Poisson or negative binomial model using PROC GENMOD. See the GENMOD documentation. If the response is positive and continuous, then you could try fitting a zero-inflated gamma model using PROC FMM - for example:&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;data a; 
call streaminit(2342);
do i=1 to 100;
 y=rand("gamma",2);
 output;
end; 
do i=1 to 10; y=0; output; end;
run;

/* histogram of data */
proc sgplot data=a;
histogram y / showbins nbins=9;
run;

/* zero-inflated gamma model */
proc fmm data=a plots=density(nbins=9);
model y= / dist=gamma;
model + / dist=constant;
run;

&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Thu, 04 Jan 2018 18:23:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Zero-Inflated-Dependent-variable/m-p/425012#M22354</guid>
      <dc:creator>StatDave</dc:creator>
      <dc:date>2018-01-04T18:23:29Z</dc:date>
    </item>
  </channel>
</rss>

