Hello;
I coded a proc univariate to analyze the variable FEV for smokers and non-smokers. Let me write the code here and the output as well.
PROC MEANS DATA=pulfun;
CLASS smoke;
VAR fev;
RUN;
PROC SORT DATA=pulfun;
BY smoke;
RUN;
PROC UNIVARIATE DATA=pulfun PLOT;
By smoke;
VAR fev;
And, I got the output attached as PDF file. My question is:
"Based on the output of PROC UNIVARIATE, describe the differences and similarities in the shapes of the FEV distributions for smokers and non-smokers. Specifically identify the features of the output that you used."
Help me please. Thank you!
It's easier to make a comparison if they're overlayed in some manner.
Try the histogram statement.
So, could you answer my question please? I am not clear with that concept. I am waiting.
Thank you!
Distributions have specific features that are used for comparison:
measure of center such as median/mean
Tails
Outliers
You can compare the distributions using the graphs and/or the box plot. I find a box plot easier to write up a comparison.
So how do smokers compare for the median and mean for the FEV variable? Are the tails longer in one than the other? In a specific direction? What about outliers between the two?
Khan Academy has a full chapter on this topic (Comparing features of a distribution). They do a better job of explaining this than I would.
This is clearly homework so this as much of an answer I'm willing to provide (my personal rule). Others may (or may not) choose to provide more assistance. Good Luck.
SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!
Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.