BookmarkSubscribeRSS Feed
☑ This topic is solved. Need further help from the community? Please sign in and ask a new question.
jhh197
Pyrite | Level 9

Hi ,

I have two datasets have and dates . For PMC end date is 30SEP2024 and need to extend this until end date in dates table that is 29 Dec2028 . First check dates in dates table which are greater than 30SEP2024 and add only those dates as an extension to output dataset Test5 which are 

31OCT2024 , 29NOV2024 , 28FEB2025 , 31MAR2027 , 31AUG2028 , 29DEC2028. Same for ERC .  I have used below code but that creates an intermediate dataset want which is having all dates in the year after  30SEP2024 . Is there any other code i can use related to performance . 

 

Can anyone please help . This is an extension to original post 

 

https://communities.sas.com/t5/SAS-Programming/Extend-Dates-to-the-existing-SAS-Dataset/m-p/854924#M...

data have;
input contype$1-3 TimeStamp CP PP MM;
informat TimeStamp date9.;
format TimeStamp date9.;
cards;
PMC 31May2024 200 200 200
PMC 28JUN2024 235 244 33
PMC 31JUL2024 235 500 654
PMC 30SEP2024 238 234 222
ERC 31JAN2023 109 546 230
ERC 28FEB2023 234 320 777
ERC 31MAR2023 223 111 435
ERC 29SEP2023 187 325 120
;
run;

proc sort data=have;
by contype timestamp;
run;

data dates;
input enddate date9.;
informat enddate date9.;
format enddate date9.;
cards;
30JUL2022
31JAN2023
28FEB2023
31MAR2023
31JUL2023
29SEP2023
31JAN2024
31May2024
28JUN2024
31JUL2024
30SEP2024
31OCT2024
29NOV2024
28FEB2025
31MAR2027
31AUG2028
29DEC2028
;
run;

proc sort data=dates;
by enddate;
run;

proc summary data=dates ;
var enddate;
output out=lastday max=enddate;
run;

data want;
if _n_=1 then set lastday(keep=enddate);
set have;
by contype;
output;
if last.contype then do;
cp=0; pp=0; mm=0;
do timestamp=timestamp+1 to enddate;
output;
end;
end;
run;

proc sql;
create table test as
select a.*
from want as a
inner join dates as b on a.TimeStamp= b.enddate;
quit;

Input Datasets 

data have;
input contype$1-3 TimeStamp CP PP MM;
informat TimeStamp date9.;
format TimeStamp date9.;
cards;
PMC 31May2024 200 200 200
PMC 28JUN2024 235 244 33
PMC 31JUL2024 235 500 654
PMC 30SEP2024 238 234 222
ERC 31JAN2023 109 546 230
ERC 28FEB2023 234 320 777
ERC 31MAR2023 223 111 435
ERC 29SEP2023 187 325 120
;
run;

data dates;
input enddate date9.;
informat enddate date9.;
format enddate date9.;
cards;
30JUL2022
31JAN2023
28FEB2023
31MAR2023
31JUL2023
29SEP2023
31JAN2024
31May2024
28JUN2024
31JUL2024
30SEP2024
31OCT2024
29NOV2024
28FEB2025
31MAR2027
31AUG2028
29DEC2028
;
run;

Output like this

Test5

contypeTimeStampCPPPMM
ERC31-Jan-23109546230
ERC28-Feb-23234320777
ERC31-Mar-23223111435
ERC29-Sep-23187325120
ERC31-Jan-24000
ERC31-May-24000
ERC28-Jun-24000
ERC31-Jul-24000
ERC30-Sep-24000
ERC31-Oct-24000
ERC29-Nov-24000
ERC28-Feb-25000
ERC31-Mar-27000
ERC31-Aug-28000
ERC29-Dec-28000
PMC31-May-24200200200
PMC28-Jun-2423524433
PMC31-Jul-24235500654
PMC30-Sep-24238234222
PMC31-Oct-24000
PMC29-Nov-24000
PMC28-Feb-25000
PMC31-Mar-27000
PMC31-Aug-28000
PMC29-Dec-28000
1 ACCEPTED SOLUTION

Accepted Solutions
jhh197
Pyrite | Level 9

hi ,

I understood what the error is i have data in the form of timestamp like seconds so its adding +1 to seconds . Error is cleared . Thank you so much .

View solution in original post

19 REPLIES 19
jhh197
Pyrite | Level 9

when i run the code for huge dataset with more than 300 distinct contype i get below error while creating dataset want 

 

ERROR: Insufficient space in file WANT

ERROR: File WANT is damaged. I/O processing did not complete.

 

Can anyone please help . Thank you 

Kurt_Bremser
Super User

Messages without context are useless. Post the complete log (all code and messages) of the offending step. If previous steps had at least WARNINGs, include them. Post the log by copy/pasting it into a window opened with this button:

Bildschirmfoto 2020-04-07 um 08.32.59.jpg

jhh197
Pyrite | Level 9
In have dataset for Contype PMC enddate is 30SEP2024 . After that we have below dates in Dates table so i add these dates to output dataset and for CP,PP and MM columns it is 0 values . Output should be like Test 5 . How do i achieve output like Test5 ?
31OCT2024
29NOV2024
28FEB2025
31MAR2027
31AUG2028
29DEC2028
andreas_lds
Jade | Level 19

@jhh197 wrote:
In have dataset for Contype PMC enddate is 30SEP2024 . After that we have below dates in Dates table so i add these dates to output dataset and for CP,PP and MM columns it is 0 values . Output should be like Test 5 . How do i achieve output like Test5 ?
31OCT2024
29NOV2024
28FEB2025
31MAR2027
31AUG2028
29DEC2028

Sorry, but i don't understand the relation between this post and the earlier one, which is unreadable due to posting unformatted sas code.

PaigeMiller
Diamond | Level 26

@jhh197 wrote:
In have dataset for Contype PMC enddate is 30SEP2024 . After that we have below dates in Dates table so i add these dates to output dataset and for CP,PP and MM columns it is 0 values . Output should be like Test 5 . How do i achieve output like Test5 ?
31OCT2024
29NOV2024
28FEB2025
31MAR2027
31AUG2028
29DEC2028

You were asked to provide the log, and given instructions to do so; you didn't provide the log. Please provide the log and follow the instructions given.

--
Paige Miller
jhh197
Pyrite | Level 9
Sure will do .
jhh197
Pyrite | Level 9

hi ,

I understood what the error is i have data in the form of timestamp like seconds so its adding +1 to seconds . Error is cleared . Thank you so much .

jhh197
Pyrite | Level 9

33 data have;
34 input contype$1-3 TimeStamp CP PP MM;
35 informat TimeStamp date9.;
36 format TimeStamp date9.;
37 cards;

NOTE: The data set WORK.HAVE has 8 observations and 5 variables.
NOTE: DATA statement used (Total process time):
real time 0.00 seconds
cpu time 0.00 seconds

46 ;
47 run;
48
49 proc sort data=have;
50 by contype timestamp;
51 run;

NOTE: There were 8 observations read from the data set WORK.HAVE.
NOTE: The data set WORK.HAVE has 8 observations and 5 variables.
NOTE: PROCEDURE SORT used (Total process time):
real time 0.00 seconds
cpu time 0.00 seconds

53 data dates;
54 input enddate date9.;
55 informat enddate date9.;
56 format enddate date9.;
57 cards;

NOTE: The data set WORK.DATES has 17 observations and 1 variables.
NOTE: DATA statement used (Total process time):
real time 0.00 seconds
cpu time 0.00 seconds

75 ;
76 run;
77
78 proc sort data=dates;
79 by enddate;
80 run;


NOTE: There were 17 observations read from the data set WORK.DATES.
NOTE: The data set WORK.DATES has 17 observations and 1 variables.
NOTE: PROCEDURE SORT used (Total process time):
real time 0.00 seconds
cpu time 0.00 seconds

81
82 proc summary data=dates ;
83 var enddate;
84 output out=lastday max=enddate;
85 run;

NOTE: There were 17 observations read from the data set WORK.DATES.
NOTE: The data set WORK.LASTDAY has 1 observations and 3 variables.
NOTE: PROCEDURE SUMMARY used (Total process time):
real time 0.00 seconds
cpu time 0.01 seconds


86
87 data want;
88 if _n_=1 then set lastday(keep=enddate);
89 set have;
90 by contype;
91 output;
92 if last.contype then do;
93 cp=0; pp=0; mm=0;
94 do timestamp=timestamp+1 to enddate;
95 output;
96 end;
97 end;
98 run;

NOTE: There were 1 observations read from the data set WORK.LASTDAY.
NOTE: There were 8 observations read from the data set WORK.HAVE.

NOTE: The data set WORK.WANT has 3477 observations and 6 variables.
NOTE: DATA statement used (Total process time):
real time 0.00 seconds
cpu time 0.00 seconds

99
100 proc sql;
101 create table test as
102 select a.*
103 from want as a
104 inner join dates as b on a.TimeStamp= b.enddate;
NOTE: Table WORK.TEST created, with 25 rows and 6 columns.

105 quit;
NOTE: PROCEDURE SQL used (Total process time):
real time 0.00 seconds
cpu time 0.00 seconds

 

Here in want I have 3477 observations because in this dataset timestamp+1 is extending to all dates until 29dec2028 like for example but i dont want all the dates in oct want only 31OCT2024 which is in dates table . Is there way to do this ? So it occupies less space . 

contypeTimeStampCPPPMM
ERC30-Sep-24000
ERC1-Oct-24000
ERC2-Oct-24000
ERC3-Oct-24000
ERC4-Oct-24000
ERC5-Oct-24000
ERC6-Oct-24000
ERC7-Oct-24000
ERC8-Oct-24000
ERC9-Oct-24000
ERC10-Oct-24000
ERC11-Oct-24000
ERC12-Oct-24000
ERC13-Oct-24000
ERC14-Oct-24000
ERC15-Oct-24000
ERC16-Oct-24000
ERC17-Oct-24000
ERC18-Oct-24000
ERC19-Oct-24000
ERC20-Oct-24000
ERC21-Oct-24000
ERC22-Oct-24000
ERC23-Oct-24000

 

jhh197
Pyrite | Level 9
Deciding by Max date 29DEC2028
jhh197
Pyrite | Level 9
I have an input dataset have in that for each contype check for max date and for PMC 30sep2024 is max date and i want all records from have and then i want only the records which are greater than 30sep2024 from dates dataset and for those records cp,pp and mm is zero . Output for PMC should be same like how i have in test5

hackathon24-white-horiz.png

The 2025 SAS Hackathon has begun!

It's finally time to hack! Remember to visit the SAS Hacker's Hub regularly for news and updates.

Latest Updates

How to Concatenate Values

Learn how use the CAT functions in SAS to join values from multiple variables into a single value.

Find more tutorials on the SAS Users YouTube channel.

SAS Training: Just a Click Away

 Ready to level-up your skills? Choose your own adventure.

Browse our catalog!

Discussion stats
  • 19 replies
  • 5864 views
  • 0 likes
  • 4 in conversation