BookmarkSubscribeRSS Feed
heretolearn
Obsidian | Level 7

Hello!

 

I am trying to figure out how to calculate the sum by two variables using proc sql.

 

This is what I have:

id      date              change

1       1/3/15           1

1       1/3/15           0

1       1/3/15           1

1       2/1/15           -1

2       4/1/10           1

2       4/1/10           -1

2       7/10/10         0

 

This is what I need:

id      date              all_change    min_change   max_change

1       1/3/15           2                   0                    1

 

1       2/1/15           -1                 -1                   -1

2       4/1/10           0                  -1                    1

2       7/10/10         0                   0                    0

 

I tried using this:

Proc sql;
Create table change as
Select id
,date
,sum(change) AS all_change
,min(change) AS min_change
,max(change) AS max_change
From dataset
group by
id and date;
Quit;

 

Instead of giving me the sum, min, and max by each id's set of dates, it takes the values for all ids. So I get the same sum of all the change rows in the full dataset for all the ids.

 

Can anyone let me know what I am doing wrong?

1 REPLY 1
Reeza
Super User
Group by ID, Date

No AND


@heretolearn wrote:

Hello!

 

I am trying to figure out how to calculate the sum by two variables using proc sql.

 

This is what I have:

id      date              change

1       1/3/15           1

1       1/3/15           0

1       1/3/15           1

1       2/1/15           -1

2       4/1/10           1

2       4/1/10           -1

2       7/10/10         0

 

This is what I need:

id      date              all_change    min_change   max_change

1       1/3/15           2                   0                    1

 

1       2/1/15           -1                 -1                   -1

2       4/1/10           0                  -1                    1

2       7/10/10         0                   0                    0

 

I tried using this:

Proc sql;
Create table change as
Select id
,date
,sum(change) AS all_change
,min(change) AS min_change
,max(change) AS max_change
From dataset
group by
id and date;
Quit;

 

Instead of giving me the sum, min, and max by each id's set of dates, it takes the values for all ids. So I get the same sum of all the change rows in the full dataset for all the ids.

 

Can anyone let me know what I am doing wrong?


 

hackathon24-white-horiz.png

2025 SAS Hackathon: There is still time!

Good news: We've extended SAS Hackathon registration until Sept. 12, so you still have time to be part of our biggest event yet – our five-year anniversary!

Register Now

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 1 reply
  • 893 views
  • 0 likes
  • 2 in conversation