I am looking at a data set about football players. After I read in the data set, I am trying to calculate the 5-number summary for the passYD variable for only players who position in qb. I want to do this separately for each value of team. This is the code I have been trying
DATA NFL;
FILENAME webpage URL 'http://people.stat.sc.edu/hitchcock/nfl_season_data.txt';
infile webpage DLM=',' DSD;
INPUT idcode $ lastname :$20. firstname :$20. year team $ position $ G GS COMP ATT
PassYD PassTD INT rush rushYD rushTD rec recYD recTD;
RUN;
PROC MEANS=PassYD;
WHERE position=qb;
BY team;
RUN;
@Steelersgirl wrote:
I am looking at a data set about football players. After I read in the data set, I am trying to calculate the 5-number summary for the passYD variable for only players who position in qb. I want to do this separately for each value of team. This is the code I have been trying
DATA NFL;
FILENAME webpage URL 'http://people.stat.sc.edu/hitchcock/nfl_season_data.txt';
infile webpage DLM=',' DSD;
INPUT idcode $ lastname :$20. firstname :$20. year team $ position $ G GS COMP ATT
PassYD PassTD INT rush rushYD rushTD rec recYD recTD;
RUN;
PROC MEANS=PassYD;
WHERE position=qb;
BY team;
RUN;
You can specify the statistics desired in the PROC MEANS statement.
You'll also likely want to capture the output into a data set.
PROC MEANS DATA=NFL N Mean MEdian MIN MAX STD; *list statistics here; WHERE position='qb'; *note that this is case sensitive; BY team; var varName; *list your variable to analyze here ; ods output summary = want; *store results; RUN;
Hope that helps.
What results does the code give?
qd is not a variable is it? it's a value so must be quoted.
@Steelersgirl wrote:
I am looking at a data set about football players. After I read in the data set, I am trying to calculate the 5-number summary for the passYD variable for only players who position in qb. I want to do this separately for each value of team. This is the code I have been trying
DATA NFL;
FILENAME webpage URL 'http://people.stat.sc.edu/hitchcock/nfl_season_data.txt';
infile webpage DLM=',' DSD;
INPUT idcode $ lastname :$20. firstname :$20. year team $ position $ G GS COMP ATT
PassYD PassTD INT rush rushYD rushTD rec recYD recTD;
RUN;
PROC MEANS=PassYD;
WHERE position=qb;
BY team;
RUN;
You can specify the statistics desired in the PROC MEANS statement.
You'll also likely want to capture the output into a data set.
PROC MEANS DATA=NFL N Mean MEdian MIN MAX STD; *list statistics here; WHERE position='qb'; *note that this is case sensitive; BY team; var varName; *list your variable to analyze here ; ods output summary = want; *store results; RUN;
Hope that helps.
Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.