Programming the statistical procedures from SAS

Analyzing zero-inflated near-normally distributed data

Reply
Contributor
Posts: 25

Analyzing zero-inflated near-normally distributed data

Hi all,

I have an outcome variable measuring change of numbers of unhealthy days across two time points. The data is highly zero-inflated with both negative and positive numbers . Does anyone know how to transform the data to make it normally distributed? Ideally I want to use linear regression model but there might be other models which deal better with non-parametric data like this.

Please feel free to throw in any ideas!

Thanks a lot!

Respected Advisor
Posts: 4,756

Re: Analyzing zero-inflated near-normally distributed data

You should look at the FMM procedure. Start with the introductory zero-inflation example:

http://support.sas.com/documentation/cdl/en/statug/67523/HTML/default/viewer.htm#statug_fmm_gettings...

PG

PG
Respected Advisor
Posts: 2,655

Re: Analyzing zero-inflated near-normally distributed data

Is the response you are looking at the difference between timepoint A and timepoint B (these might be intervals of equal length)?  If so, rather than looking at differences, could you treat the values as repeated measures, and then look at the difference in the least squares means.

The reason I suggest this is because I think an observation of 3 days for A and 3 days for B resulting in a zero is fundamentally different from an observation of 90 days for A and 90 days for B, which also results in a zero.

Steve Denham

Message was edited by: Steve Denham

Ask a Question
Discussion stats
  • 2 replies
  • 276 views
  • 0 likes
  • 3 in conversation