BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
gdaymte
Obsidian | Level 7

I have a dataset with a list of facilities and their number of cases by day. I know how calculate cumulative sums over the entire time frame (day 1 to present). I have been asked to calculate a moving 2-week cumulative sum for the last 3 2-week periods. Let's say today is April 28. I would need the cumulative sum between April 15-28 (CumSum1), April 8-21 (CumSum2), and April 1-14 (CumSum3). I have reviewed examples of PROC EXPAND, but I am having a hard time following the syntax. Any help would be greatly appreciated.

 

Have: 

FacName DtCreated CntCase
FACILITY1 04012020 0
FACILITY1 04022020 0
FACILITY1 04032020 0
FACILITY1 04042020 0
FACILITY1 04052020 0
FACILITY1 04062020 0
FACILITY1 04072020 0
FACILITY1 04082020 0
FACILITY1 04092020 0
FACILITY1 04102020 0
FACILITY1 04112020 0
FACILITY1 04122020 0
FACILITY1 04132020 0
FACILITY1 04142020 0
FACILITY1 04152020 0
FACILITY1 04162020 0
FACILITY1 04172020 0
FACILITY1 04182020 0
FACILITY1 04192020 0
FACILITY1 04202020 0
FACILITY1 04212020 0
FACILITY1 04222020 0
FACILITY1 04232020 0
FACILITY1 04242020 1
FACILITY1 04252020 0
FACILITY1 04262020 0
FACILITY1 04272020 0
FACILITY1 04282020 0
FACILITY2 04012020 1
FACILITY2 04022020 0
FACILITY2 04032020 1
FACILITY2 04042020 1
FACILITY2 04052020 1
FACILITY2 04062020 1
FACILITY2 04072020 8
FACILITY2 04082020 31
FACILITY2 04092020 5
FACILITY2 04102020 1
FACILITY2 04112020 0
FACILITY2 04122020 6
FACILITY2 04132020 0
FACILITY2 04142020 3
FACILITY2 04152020 2
FACILITY2 04162020 4
FACILITY2 04172020 0
FACILITY2 04182020 3
FACILITY2 04192020 0
FACILITY2 04202020 3
FACILITY2 04212020 3
FACILITY2 04222020 2
FACILITY2 04232020 1
FACILITY2 04242020 1
FACILITY2 04252020 1
FACILITY2 04262020 0
FACILITY2 04272020 1
FACILITY2 04282020 0

 

Want:

Facility CumSum1 CumSum2 CumSum3
FACILITY1 1 0 0
FACILITY2 21 61 0
1 ACCEPTED SOLUTION

Accepted Solutions
PGStats
Opal | Level 21

If the ending date is fixed across facilities, you could do:

 

*%let td=%sysfunc(date());
/* Or */
%let td=%sysevalf('24apr2020'd);


proc expand data=have out=temp (drop=cntcase where=(dtback in (0,7,14))) ;  
by facname;  
id dtcreated;  
convert cntcase=sum14 / transformin=(movsum 14);
convert dtcreated=dtback / transformout=(neg + &td);
run;

proc transpose data=temp out=want(drop=_name_) prefix=x_;
by facname;
var sum14;
id dtback;
format dtback best.;
run;
PG

View solution in original post

7 REPLIES 7
ChrisNZ
Tourmaline | Level 20

Why not create formats? Since the periods overlap you must remember to use the multilabel option.

mkeintz
PROC Star

PROC EXPAND can easily generate rolling two week sums.  The "trick" is to keep only the most recent two week sum.  If all your facilities have the same date range (i.e. all end on '28apr2020'd), and they are all recorded daily, then this would work:

 

proc expand data=have out=want (drop=cntcase where=(dtcreated>='28apr2020'd)) ;
  by facname;
  id dtcreated;
  convert cntcase=sum14 / transformin=(movsum 14);
run;

If the ending date varies between facilities, then a data step might be better:

data want2 (keep=facname dtcreated sum14);
  set have;
  by facname;
  array days14 {0:13} _temporary_;
  if first.facname then call missing(of days14{*});
  days14{mod(_n_,14)}=cntcase;
  if last.facname;
  if n(of days14{*}) then sum14=sum(of days14{*});
run;

 

--------------------------
The hash OUTPUT method will overwrite a SAS data set, but not append. That can be costly. Consider voting for Add a HASH object method which would append a hash object to an existing SAS data set

Would enabling PROC SORT to simultaneously output multiple datasets be useful? Then vote for
Allow PROC SORT to output multiple datasets

--------------------------
PGStats
Opal | Level 21

If the ending date is fixed across facilities, you could do:

 

*%let td=%sysfunc(date());
/* Or */
%let td=%sysevalf('24apr2020'd);


proc expand data=have out=temp (drop=cntcase where=(dtback in (0,7,14))) ;  
by facname;  
id dtcreated;  
convert cntcase=sum14 / transformin=(movsum 14);
convert dtcreated=dtback / transformout=(neg + &td);
run;

proc transpose data=temp out=want(drop=_name_) prefix=x_;
by facname;
var sum14;
id dtback;
format dtback best.;
run;
PG
gdaymte
Obsidian | Level 7

@PGStats The end date will always be the same date. I have also used PROC TIMESERIES to fill in the missing dates and assign a value of 0.

 

This code works great. What is the word "neg" doing in the transformout segment? Another question, do you know why a sum would come out to be -0? Not all of my zeros are coming out this way, just on occasion.

 

FacName X_14 X_7 X_0
Facility1 0 2 2
Facility2 1 -0 0
Facility3 -0 -0 2

 

Thanks for everyone's help.

PGStats
Opal | Level 21

NEG replaces x with -x . Those minus zeros are certainly small numbers that are rounded to zero for display. This is something that I noticed before, Proc expand is very fast but it doesn't seem to use the same precautions as other SAS procs to prevent rounding error propagation.

PG
gdaymte
Obsidian | Level 7
Ok thanks. This is very helpful.
yabwon
Amethyst | Level 16

Hi @gdaymte,

 

How about conditional aggregation with SQL and little %MACRO? (btw. shouldn't CumSum3 for "Fac2" be 59?)

 

data have;
  input FacName : $ 12. DtCreated : mmddyy8. CntCase;
  format DtCreated mmddyy10.;
cards4;
FACILITY1 04012020 0
FACILITY1 04022020 0
FACILITY1 04032020 0
FACILITY1 04042020 0
FACILITY1 04052020 0
FACILITY1 04062020 0
FACILITY1 04072020 0
FACILITY1 04082020 0
FACILITY1 04092020 0
FACILITY1 04102020 0
FACILITY1 04112020 0
FACILITY1 04122020 0
FACILITY1 04132020 0
FACILITY1 04142020 0
FACILITY1 04152020 0
FACILITY1 04162020 0
FACILITY1 04172020 0
FACILITY1 04182020 0
FACILITY1 04192020 0
FACILITY1 04202020 0
FACILITY1 04212020 0
FACILITY1 04222020 0
FACILITY1 04232020 0
FACILITY1 04242020 1
FACILITY1 04252020 0
FACILITY1 04262020 0
FACILITY1 04272020 0
FACILITY1 04282020 0
FACILITY2 04012020 1
FACILITY2 04022020 0
FACILITY2 04032020 1
FACILITY2 04042020 1
FACILITY2 04052020 1
FACILITY2 04062020 1
FACILITY2 04072020 8
FACILITY2 04082020 31
FACILITY2 04092020 5
FACILITY2 04102020 1
FACILITY2 04112020 0
FACILITY2 04122020 6
FACILITY2 04132020 0
FACILITY2 04142020 3
FACILITY2 04152020 2
FACILITY2 04162020 4
FACILITY2 04172020 0
FACILITY2 04182020 3
FACILITY2 04192020 0
FACILITY2 04202020 3
FACILITY2 04212020 3
FACILITY2 04222020 2
FACILITY2 04232020 1
FACILITY2 04242020 1
FACILITY2 04252020 1
FACILITY2 04262020 0
FACILITY2 04272020 1
FACILITY2 04282020 0
;;;;
run;

options MPRINT;
proc sql;
  select 
    FacName
    %macro intoThePast(periodsNum, start, date, aggr);
     %do i = 1 %to &periodsNum.;
      %local condition;
      %let condition = "%sysfunc(intnx(days, &start.,  7*( 1-&i.)),date9.)"d 
                       >= DtCreated > 
                       "%sysfunc(intnx(days, &start.,  7*(-1-&i.)),date9.)"d; /* make it -2 for 3 week periods, etc. */
      ,sum(case when &condition. then &aggr.
                                 else 0
           end) as CumSum&i. 
     %end;
    %mend intoThePast;
    %intoThePast(3, '28Apr2020'd, DtCreated, CntCase)
  from 
    have
  group by 
    FacName
  ;
quit;

 

All the best

Bart

_______________
Polish SAS Users Group: www.polsug.com and communities.sas.com/polsug

"SAS Packages: the way to share" at SGF2020 Proceedings (the latest version), GitHub Repository, and YouTube Video.
Hands-on-Workshop: "Share your code with SAS Packages"
"My First SAS Package: A How-To" at SGF2021 Proceedings

SAS Ballot Ideas: one: SPF in SAS, two, and three
SAS Documentation



hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 7 replies
  • 2612 views
  • 0 likes
  • 5 in conversation