BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
mikeed
Calcite | Level 5

I'm working with a very large dataset and it has a MONTH variable from 1-12,

and also a VALUE variable, with amounts possible for each month.

 

I would like to determine mean, median, etc. for a yearly (annual) amount,

summing each person's (observation) for 2 months of values.

 

What's the easiest way for a noob to do this?

Thanks in advance...

1 ACCEPTED SOLUTION

Accepted Solutions
Reeza
Super User

@mikeed wrote:

For each ID, I'd would like to add up all the VALUES, from MONTH 1 to MONTH 12, for an annual VALUE.

Then I would like to find summary statistics of annual VALUE for each ID.

 

Perhaps this explains it better and it can't be accomplished in a single procedure, and that's my problem?


Yes, this is a two step problem so you can simply apply your proc twice. 

The first time you're summing for the totals and in the second you're generating your summary statistics of the total value. 

 

You can modify the statistics you get and the summary based on the statistics you specify in the PROC MEANS/SUMMARY statements. 

You've been provided with multiple samples on how to run it for one, so you should be able to expand it to two sets of data. 

But regardless, here's one way:

 

Screen Shot 2018-04-14 at 12.30.32 PM.png

 

Screen Shot 2018-04-14 at 12.30.52 PM.png

 

View solution in original post

20 REPLIES 20
PaigeMiller
Diamond | Level 26

I would like to determine mean, median, etc. for a yearly (annual) amount,

summing each person's (observation) for 2 months of values.

 

I''m afraid I'm not able to comprehend this part of the request. The top line makes perfect sense, but not in combination with the second line, which is rather cryptic.

 

Please show us a small amount of this data, and explain what results you'd like from this small amount of data.

--
Paige Miller
mikeed
Calcite | Level 5

sorry, typo. all 12 months.

 

each person has a unique ID and MONTHCODE variable from 1-12.

I want to sum the VALUEs of MONTHCODE1-12 for each person, then determine mean/median annual VALUE.

 

PaigeMiller
Diamond | Level 26

I tend to use PROC SUMMARY for this, UNIVARIATE would also work but the code would be different

 

/* UNTESTED CODE */

proc summary data=have;
    class id;
    var value;
    output out=want mean=meanvalue median=medianvalue;
run;

  

--
Paige Miller
mikeed
Calcite | Level 5

thanks,

would you be able to provide an example of the proc univariate code I would use?

PaigeMiller
Diamond | Level 26

As I haven't used UNIVARIATE in years, my answer is that I can't, off the top of my head, provide UNIVARIATE code. It probably isn't much different, however you can read the documentation for PROC UNIVARIATE and see if you can figure it out. 

 

The other problem is that UNIVARIATE is computing a huge amount of statistics that you haven't requested, and depending on much data you have, this could slow things down dramatically and produce a huge long output file.

--
Paige Miller
Reeza
Super User

Proc means, summary and univariate use similar processes so either should work for your request. 

Why is PROC UNIVARIATE 'required'? 

 

Here's a fully worked example of getting summary statistics using PROC MEAN:

https://github.com/statgeek/SAS-Tutorials/blob/master/proc_means_basic.sas

 

 

mikeed
Calcite | Level 5

I don't need to use Univariate, but I'm still having difficulty trying to figure out

how to determine the annual values of the summary statistics  Smiley Sad 

Help still requested.

PaigeMiller
Diamond | Level 26

@mikeed wrote:

I don't need to use Univariate, but I'm still having difficulty trying to figure out

how to determine the annual values of the summary statistics  Smiley Sad 

Help still requested.


If the code I gave is not working properly for you, then please explain what is happening that is wrong, and show us the SASLOG and results. Otherwise, I assume the problem has been solved.

--
Paige Miller
mikeed
Calcite | Level 5

ID MONTH VALUE

1   1             65

1   2             17

 

.    .               .

1   11           47

1   12           99  

 

2   1             55

2   2             98

 .    .               .

2   11            45

2   12            18  

3

...

ballardw
Super User

@mikeed wrote:

ID MONTH VALUE

1   1             65

1   2             17

 

.    .               .

1   11           47

1   12           99  

 

2   1             55

2   2             98

 .    .               .

2   11            45

2   12            18  

3

...


NOW provide what the output is supposed to look like for that input.

mikeed
Calcite | Level 5

For each ID, I'd would like to add up all the VALUES, from MONTH 1 to MONTH 12, for an annual VALUE.

Then I would like to find summary statistics of annual VALUE for each ID.

 

Perhaps this explains it better and it can't be accomplished in a single procedure, and that's my problem?

PaigeMiller
Diamond | Level 26

@mikeed wrote:

For each ID, I'd would like to add up all the VALUES, from MONTH 1 to MONTH 12, for an annual VALUE.

Then I would like to find summary statistics of annual VALUE for each ID.

 

Perhaps this explains it better and it can't be accomplished in a single procedure, and that's my problem?


I provided code to do this in a single procedure already in the thread. Why do you discuss this as if it there is no such code?

--
Paige Miller
mikeed
Calcite | Level 5

I'm sorry, your code was not detailed enough and did not work since it did not help me

tally the MONTHs that I needed to find the statistics for.

PaigeMiller
Diamond | Level 26

You can't just say "it didn't work". You have to give us details. You have to show us the SASLOG and the data set created, and explain why this is not the proper result.

--
Paige Miller

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 20 replies
  • 1515 views
  • 3 likes
  • 4 in conversation