Hi, I'm working with a dataset that is a combination of two separate datasets. There are certain variables that are positively skewed. I have decided to winsorize to address this but I wasn't sure if I should winsorize the variables from the different datasets separately or all together. For example (winsorize at 75th percentile): Dataset Freckles Winsorize_together_75 Winsorize_groups_separately_75 1 10 10 10 1 15 15 15 1 20 20 20 1 99 75 99 1 100 75 99 1 10 10 10 2 15 15 15 2 20 20 20 2 20 20 20 2 25 25 25 2 75 75 55 2 105 75 55 2 35 35 35 2 35 35 35 Should I winsorize the positively skewed variable for the overall dataset or for the two datasets separately? Thanks!
... View more