Calcite | Level 5

## Proc Univariate help for a newbie

I'm working with a very large dataset and it has a MONTH variable from 1-12,

and also a VALUE variable, with amounts possible for each month.

I would like to determine mean, median, etc. for a yearly (annual) amount,

summing each person's (observation) for 2 months of values.

What's the easiest way for a noob to do this?

1 ACCEPTED SOLUTION

Accepted Solutions
Super User

## Re: Proc Univariate help for a newbie

@mikeed wrote:

For each ID, I'd would like to add up all the VALUES, from MONTH 1 to MONTH 12, for an annual VALUE.

Then I would like to find summary statistics of annual VALUE for each ID.

Perhaps this explains it better and it can't be accomplished in a single procedure, and that's my problem?

Yes, this is a two step problem so you can simply apply your proc twice.

The first time you're summing for the totals and in the second you're generating your summary statistics of the total value.

You can modify the statistics you get and the summary based on the statistics you specify in the PROC MEANS/SUMMARY statements.

You've been provided with multiple samples on how to run it for one, so you should be able to expand it to two sets of data.

But regardless, here's one way:

20 REPLIES 20
Diamond | Level 26

## Re: Proc Univariate help for a newbie

I would like to determine mean, median, etc. for a yearly (annual) amount,

summing each person's (observation) for 2 months of values.

I''m afraid I'm not able to comprehend this part of the request. The top line makes perfect sense, but not in combination with the second line, which is rather cryptic.

Please show us a small amount of this data, and explain what results you'd like from this small amount of data.

--
Paige Miller
Calcite | Level 5

## Re: Proc Univariate help for a newbie

sorry, typo. all 12 months.

each person has a unique ID and MONTHCODE variable from 1-12.

I want to sum the VALUEs of MONTHCODE1-12 for each person, then determine mean/median annual VALUE.

Diamond | Level 26

## Re: Proc Univariate help for a newbie

I tend to use PROC SUMMARY for this, UNIVARIATE would also work but the code would be different

``````/* UNTESTED CODE */

proc summary data=have;
class id;
var value;
output out=want mean=meanvalue median=medianvalue;
run;``````

--
Paige Miller
Calcite | Level 5

## Re: Proc Univariate help for a newbie

thanks,

would you be able to provide an example of the proc univariate code I would use?

Diamond | Level 26

## Re: Proc Univariate help for a newbie

As I haven't used UNIVARIATE in years, my answer is that I can't, off the top of my head, provide UNIVARIATE code. It probably isn't much different, however you can read the documentation for PROC UNIVARIATE and see if you can figure it out.

The other problem is that UNIVARIATE is computing a huge amount of statistics that you haven't requested, and depending on much data you have, this could slow things down dramatically and produce a huge long output file.

--
Paige Miller
Super User

## Re: Proc Univariate help for a newbie

Proc means, summary and univariate use similar processes so either should work for your request.

Why is PROC UNIVARIATE 'required'?

Here's a fully worked example of getting summary statistics using PROC MEAN:

https://github.com/statgeek/SAS-Tutorials/blob/master/proc_means_basic.sas

Calcite | Level 5

## Re: Proc Univariate help for a newbie

I don't need to use Univariate, but I'm still having difficulty trying to figure out

how to determine the annual values of the summary statistics

Help still requested.

Diamond | Level 26

## Re: Proc Univariate help for a newbie

@mikeed wrote:

I don't need to use Univariate, but I'm still having difficulty trying to figure out

how to determine the annual values of the summary statistics

Help still requested.

If the code I gave is not working properly for you, then please explain what is happening that is wrong, and show us the SASLOG and results. Otherwise, I assume the problem has been solved.

--
Paige Miller
Calcite | Level 5

## Re: Proc Univariate help for a newbie

ID MONTH VALUE

1   1             65

1   2             17

.    .               .

1   11           47

1   12           99

2   1             55

2   2             98

.    .               .

2   11            45

2   12            18

3

...

Super User

## Re: Proc Univariate help for a newbie

@mikeed wrote:

ID MONTH VALUE

1   1             65

1   2             17

.    .               .

1   11           47

1   12           99

2   1             55

2   2             98

.    .               .

2   11            45

2   12            18

3

...

NOW provide what the output is supposed to look like for that input.

Calcite | Level 5

## Re: Proc Univariate help for a newbie

For each ID, I'd would like to add up all the VALUES, from MONTH 1 to MONTH 12, for an annual VALUE.

Then I would like to find summary statistics of annual VALUE for each ID.

Perhaps this explains it better and it can't be accomplished in a single procedure, and that's my problem?

Diamond | Level 26

## Re: Proc Univariate help for a newbie

@mikeed wrote:

For each ID, I'd would like to add up all the VALUES, from MONTH 1 to MONTH 12, for an annual VALUE.

Then I would like to find summary statistics of annual VALUE for each ID.

Perhaps this explains it better and it can't be accomplished in a single procedure, and that's my problem?

I provided code to do this in a single procedure already in the thread. Why do you discuss this as if it there is no such code?

--
Paige Miller
Calcite | Level 5

## Re: Proc Univariate help for a newbie

I'm sorry, your code was not detailed enough and did not work since it did not help me

tally the MONTHs that I needed to find the statistics for.

Diamond | Level 26

## Re: Proc Univariate help for a newbie

You can't just say "it didn't work". You have to give us details. You have to show us the SASLOG and the data set created, and explain why this is not the proper result.

--
Paige Miller
Discussion stats
• 20 replies
• 1161 views
• 3 likes
• 4 in conversation