New Contributor
Posts: 3

# PROC UNIVARIATE output explanation

Hello;

I coded a proc univariate to analyze the variable FEV for smokers and non-smokers. Let me write the code here and the output as well.

`PROC MEANS DATA=pulfun;CLASS smoke;VAR fev;RUN;PROC SORT DATA=pulfun;BY smoke;RUN;PROC UNIVARIATE DATA=pulfun PLOT;By smoke;VAR fev;`

And, I got the output attached as PDF file. My question is:

"Based on the output of PROC UNIVARIATE, describe the differences and similarities in the shapes of the FEV distributions for smokers and non-smokers.  Specifically identify the features of the output that you used."

Super User
Posts: 23,683

## Re: PROC UNIVARIATE output explanation

It's easier to make a comparison if they're overlayed in some manner.

Try the histogram statement.

New Contributor
Posts: 3

## Re: PROC UNIVARIATE output explanation

So, could you answer my question please? I am not clear with that concept. I am waiting.

Thank you!

Super User
Posts: 23,683

## Re: PROC UNIVARIATE output explanation

Distributions have specific features that are used for comparison:

measure of center such as median/mean

Tails

Outliers

You can compare the distributions using the graphs and/or the box plot. I find a box plot easier to write up a comparison.

So how do smokers compare for the median and mean for the FEV variable? Are the tails longer in one than the other? In a specific direction? What about outliers between the two?

Khan Academy has a full chapter on this topic (Comparing features of a distribution). They do a better job of explaining this than I would.

This is clearly homework so this as much of an answer I'm willing to provide (my personal rule). Others may (or may not) choose to provide more assistance. Good Luck.

SAS Super FREQ
Posts: 4,239