BookmarkSubscribeRSS Feed
cindyforest7
Calcite | Level 5

Hi all,

I have an outcome variable measuring change of numbers of unhealthy days across two time points. The data is highly zero-inflated with both negative and positive numbers . Does anyone know how to transform the data to make it normally distributed? Ideally I want to use linear regression model but there might be other models which deal better with non-parametric data like this.

Please feel free to throw in any ideas!

Thanks a lot!

2 REPLIES 2
PGStats
Opal | Level 21

You should look at the FMM procedure. Start with the introductory zero-inflation example:

http://support.sas.com/documentation/cdl/en/statug/67523/HTML/default/viewer.htm#statug_fmm_gettings...

PG

PG
SteveDenham
Jade | Level 19

Is the response you are looking at the difference between timepoint A and timepoint B (these might be intervals of equal length)?  If so, rather than looking at differences, could you treat the values as repeated measures, and then look at the difference in the least squares means.

The reason I suggest this is because I think an observation of 3 days for A and 3 days for B resulting in a zero is fundamentally different from an observation of 90 days for A and 90 days for B, which also results in a zero.

Steve Denham

Message was edited by: Steve Denham

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 1323 views
  • 0 likes
  • 3 in conversation