Hi all,
I have an outcome variable measuring change of numbers of unhealthy days across two time points. The data is highly zero-inflated with both negative and positive numbers . Does anyone know how to transform the data to make it normally distributed? Ideally I want to use linear regression model but there might be other models which deal better with non-parametric data like this.
Please feel free to throw in any ideas!
Thanks a lot!
You should look at the FMM procedure. Start with the introductory zero-inflation example:
PG
Is the response you are looking at the difference between timepoint A and timepoint B (these might be intervals of equal length)? If so, rather than looking at differences, could you treat the values as repeated measures, and then look at the difference in the least squares means.
The reason I suggest this is because I think an observation of 3 days for A and 3 days for B resulting in a zero is fundamentally different from an observation of 90 days for A and 90 days for B, which also results in a zero.
Steve Denham
Message was edited by: Steve Denham
Available on demand!
Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.
ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.
Find more tutorials on the SAS Users YouTube channel.