Hi, I was performing Linear Regression which is based on E-Commerce Dataset. I was stuck with the following problem. Assumption: In the dataset, I am taking store_purchase_event_count as a Dependent variable for predicting store_purchase_event_count using OLS Linear Regression. Problem: I am trying to Normalize the Dependent variable but it contains more than 50-60% of zeroes. So, I was not able to figure out how I should move forward with this problem. Solutions tried: 1. Added constant to each value of Y and then taking the log. 2. Taking the square root of each value. None of the above solutions is making Y variable normal. Please suggest how to move forward
... View more