06-20-2012 12:06 PM

Hi,

I am trying to calculate the z-score for the variable that I have in my dataset using proc standard. All the columns have different mean and std, so my question is should I use a common mean and std deviation for calculating the z-score or I should calculate it separately?

in terms of code:

PROC STANDARD

DATA = X

MEAN = 0

STD = 1

OUT = ZSCORE

VAR

A /* it has a mean of 5 and std of 5 */

B /* it has a mean of 500 and std of 7 */

C /* it has a mean of 900 and std of 1000 */

run;

OR I should use this approach?

PROC STANDARD

DATA = X

MEAN = 5

STD = 5

OUT = ZSCORE_a

VAR

A /* it has a mean of 5 and std of 5 */

run;

PROC STANDARD

DATA = X

MEAN = 500

STD = 7

OUT = ZSCORE_b

VAR

B /* it has a mean of 500 and std of 7 */

run;

PROC STANDARD

DATA = X

MEAN = 900

STD = 1000

OUT = ZSCORE_c

VAR

C /* it has a mean of 900 and std of 1000 */

run;

and then merge all cols

I really appreciate your time and guidance.

Thanks!

06-20-2012 01:48 PM

See my reply in the Statistical Procedures forum. The first code block is what you want to use.

Steve Denham