I am looking at a data set about football players. After I read in the data set, I am trying to calculate the 5-number summary for the passYD variable for only players who position in qb. I want to do this separately for each value of team. This is the code I have been trying
DATA NFL;
FILENAME webpage URL 'http://people.stat.sc.edu/hitchcock/nfl_season_data.txt';
infile webpage DLM=',' DSD;
INPUT idcode $ lastname :$20. firstname :$20. year team $ position $ G GS COMP ATT
PassYD PassTD INT rush rushYD rushTD rec recYD recTD;
RUN;
PROC MEANS=PassYD;
WHERE position=qb;
BY team;
RUN;
@Steelersgirl wrote:
I am looking at a data set about football players. After I read in the data set, I am trying to calculate the 5-number summary for the passYD variable for only players who position in qb. I want to do this separately for each value of team. This is the code I have been trying
DATA NFL;
FILENAME webpage URL 'http://people.stat.sc.edu/hitchcock/nfl_season_data.txt';
infile webpage DLM=',' DSD;
INPUT idcode $ lastname :$20. firstname :$20. year team $ position $ G GS COMP ATT
PassYD PassTD INT rush rushYD rushTD rec recYD recTD;
RUN;
PROC MEANS=PassYD;
WHERE position=qb;
BY team;
RUN;
You can specify the statistics desired in the PROC MEANS statement.
You'll also likely want to capture the output into a data set.
PROC MEANS DATA=NFL N Mean MEdian MIN MAX STD; *list statistics here; WHERE position='qb'; *note that this is case sensitive; BY team; var varName; *list your variable to analyze here ; ods output summary = want; *store results; RUN;
Hope that helps.
What results does the code give?
qd is not a variable is it? it's a value so must be quoted.
@Steelersgirl wrote:
I am looking at a data set about football players. After I read in the data set, I am trying to calculate the 5-number summary for the passYD variable for only players who position in qb. I want to do this separately for each value of team. This is the code I have been trying
DATA NFL;
FILENAME webpage URL 'http://people.stat.sc.edu/hitchcock/nfl_season_data.txt';
infile webpage DLM=',' DSD;
INPUT idcode $ lastname :$20. firstname :$20. year team $ position $ G GS COMP ATT
PassYD PassTD INT rush rushYD rushTD rec recYD recTD;
RUN;
PROC MEANS=PassYD;
WHERE position=qb;
BY team;
RUN;
You can specify the statistics desired in the PROC MEANS statement.
You'll also likely want to capture the output into a data set.
PROC MEANS DATA=NFL N Mean MEdian MIN MAX STD; *list statistics here; WHERE position='qb'; *note that this is case sensitive; BY team; var varName; *list your variable to analyze here ; ods output summary = want; *store results; RUN;
Hope that helps.
Available on demand!
Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Select SAS Training centers are offering in-person courses. View upcoming courses for: