BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
PatrykSAS
Obsidian | Level 7

Hello,

I have data with the duration of the contract. However, some data is unstructured and date contracts overlap. In these cases, I would like the term of the contract to be in line with other contracts. Below is an example of what the data looks like:

ID START_DATE STOP_DATE
1 2019-01-01 2019-12-31
1 2020-01-01 2020-12-31
2 2020-01-01 2020-10-31
2 2020-01-07 2021-12-31
2 2020-09-01 2022-12-31

ID 1 is shown as valid. I would like to transform ID 2 into the following form:

ID START_DATE STOP_DATE
1 2019-01-01 2019-12-31
1 2020-01-01 2020-12-31
2 2020-01-01 2020-01-06
2 2020-01-07 2020-08-31
2 2020-09-01 2022-12-31

I know the basics of SAS and I don't know how to solve this task. Can You help me?

1 ACCEPTED SOLUTION

Accepted Solutions
Tom
Super User Tom
Super User

Here is a method for looking ahead to see the value of START_DATE on the next observation.

data want ;
  set have ;
  by id start_date stop_date ;
  set have(keep=start_date rename=(start_date=next_date) firstobs=2) 
      have(obs=1 drop=_all_)
  ;
  if last.id then next_date=.;
  new_stop = min(stop_date,next_date-1);
  format new_stop yymmdd10.;
run;
Obs    ID    START_DATE     STOP_DATE     next_date      new_stop

 1      1    2019-01-01    2019-12-31    2020-01-01    2019-12-31
 2      1    2020-01-01    2020-12-31             .    2020-12-31
 3      2    2020-01-01    2020-10-31    2020-01-07    2020-01-06
 4      2    2020-01-07    2021-12-31    2020-09-01    2020-08-31
 5      2    2020-09-01    2022-12-31             .    2022-12-31

View solution in original post

6 REPLIES 6
Sajid01
Meteorite | Level 14

Can you please explain your business logic in more detail?

 

PatrykSAS
Obsidian | Level 7
The client with the id 2 has signed 3 annexes. The first one signed until 2020.10, another one by 2021.12 and another by 2022.12. Each annex has different contract terms. Now, for example, I would like to calculate this client's margin for 2020. If I used table number 1, I would count his margins three times, which is not true. The new annex should shorten the period of validity of the previous one. In this way, I will count several months of 2020 according to one contract, and the remaining months according to the terms of another annex.
mkeintz
PROC Star

You want to modify the current stop_date if the following start_date overlaps:

data want (drop=nxt_:) ;
  set have (keep=id);
  by id;
  merge have  have (firstobs=2 keep=start_date rename=(start_date=nxt_start));
  if last.id=0 then stop_date=min(stop_date,nxt_start-1);
run;
--------------------------
The hash OUTPUT method will overwrite a SAS data set, but not append. That can be costly. Consider voting for Add a HASH object method which would append a hash object to an existing SAS data set

Would enabling PROC SORT to simultaneously output multiple datasets be useful? Then vote for
Allow PROC SORT to output multiple datasets

--------------------------
Tom
Super User Tom
Super User

Here is a method for looking ahead to see the value of START_DATE on the next observation.

data want ;
  set have ;
  by id start_date stop_date ;
  set have(keep=start_date rename=(start_date=next_date) firstobs=2) 
      have(obs=1 drop=_all_)
  ;
  if last.id then next_date=.;
  new_stop = min(stop_date,next_date-1);
  format new_stop yymmdd10.;
run;
Obs    ID    START_DATE     STOP_DATE     next_date      new_stop

 1      1    2019-01-01    2019-12-31    2020-01-01    2019-12-31
 2      1    2020-01-01    2020-12-31             .    2020-12-31
 3      2    2020-01-01    2020-10-31    2020-01-07    2020-01-06
 4      2    2020-01-07    2021-12-31    2020-09-01    2020-08-31
 5      2    2020-09-01    2022-12-31             .    2022-12-31

PatrykSAS
Obsidian | Level 7
It works! Thank you a lot 🙂
Shmuel
Garnet | Level 18

Next code is tested:

data have;
  infile cards dlm = '09'x truncover;
  input ID Start_date yymmdd10. @+1 Stop_date yymmdd10.;
  format start_date stop_date yymmdd10.;
cards;
1	2019-01-01	2019-12-31
1	2020-01-01	2020-12-31
2	2020-01-01	2020-10-31
2	2020-01-07	2021-12-31
2	2020-09-01	2022-12-31
; run;

proc sort data=have; 
    by ID descending start_date descending stop_date; 
run;
data want;
 set have;
   by ID;
        prev_start = lag(start_date);
        if not first.ID and
           stop_date > prev_start
        then  stop_date = prev_start -1;
     drop prev_start;
run; 
proc sort data=want; 
    by ID start_date stop_date; 
run;

 

 

 

 

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 6 replies
  • 1479 views
  • 6 likes
  • 5 in conversation