Hi All,
When run the following code for imputing the date and time, I am getting the following errors. Can anyone help me what is wrong with this? Also, I do not understand why prxid variable value is showing different values whenever I execute the call statement %splitIso8601dtc( inDat= ae, outDat= ksfe_2018, var= aestdtc ).
Can anyone help me to understand what is wrong with the code and how correct it?
Errors are:
ERROR: Invalid regular expression delimiter "p" in "prxid". Delimiter must be non-alphanumeric
and non-whitespace.
ERROR: The regular expression passed to the function PRXMATCH contains a syntax error.
WARNING: Argument 1 to function PRXMATCH referenced by the %SYSFUNC or %QSYSFUNC macro function
is out of range.
NOTE: Mathematical operations could not be performed during %SYSFUNC function execution. The
result of the operations have been set to a missing value.
ERROR: A character operand was found in the %EVAL function or %IF condition where a numeric
operand is required. The condition was: %sysfunc(prxmatch(prxid,&var))
ERROR: The macro SPLITISO8601DTC will stop executing.
DO NOT use MACRO code where you need SAS code.
%macro splitIso8601dtc
(inDat= /* Input dataset name */
,outDat= /* Output dataset name */
,var= /* Input characer variable name */
) / DES='Macro to separate ISO 8601 datetime components';
%local pattern ;
%let pattern = /^(\d{4})(-(\d{2}|-)(-(\d{2}|-)(T(\d{2}|-)(:(\d{2}|-)(:(\d{2}))?)?)?)?)?\s*$/;
%put NOTE: &sysmacroname: &=indat &=outdat &=var &=pattern;
* create new dataset from existing dataset ;
DATA &outDat;
SET &inDat;
* create regular expression ID to separate is8601dt datetime components;
if _n_=1 then prxid= prxparse("&pattern");
retain prxid ;
* separate datetime components;
if prxmatch(prxid,&var) then do;
&var._year= input(prxposn(prxid,1,&var), ??4.);
&var._min_month= input(prxposn(prxid,3,&var), ??2.);
&var._min_day= input(prxposn(prxid,5,&var), ??2.);
&var._min_hour= input(prxposn(prxid,7,&var), ??2.);
&var._min_minute= input(prxposn(prxid,9,&var), ??2.);
&var._min_second= input(prxposn(prxid,11,&var), ??2.);
* impute missing datetime components;
* impute month when missing;
if missing(&var._min_month) then do;
&var._min_month= 1;
&var._max_month= 12;
end;
else do;
&var._max_month= &var._min_month;
end;
* impute day when missing;
if missing(&var._min_day) then do;
&var._min_day= 1;
* Last day of the respective month and year;
&var._max_day=day(intnx('month',mdy(&var._max_month,1,&var._year),0,'E'));
end;
else do;
&var._max_day= &var._min_day;
end;
* impute hour when missing;
if missing(&var._min_hour) then do;
&var._min_hour= 0;
&var._max_hour= 23;
end;
else do;
&var._max_hour= &var._min_hour;
end;
* impute minute when missing;
if missing(&var._min_minute) then do;
&var._min_minute= 0;
&var._max_minute= 59;
end;
else do;
&var._max_minute= &var._min_minute;
end;
* impute second when missing;
IF missing(&var._min_second) then do;
&var._min_second= 0;
&var._max_second= 59;
end;
else do;
&var._max_second= &var._min_second;
end;
* create the full date/time value;
&var._min= dhms(mdy(&var._min_month, &var._min_day, &var._year)
,&var._min_hour, &var._min_minute, &var._min_second)
;
&var._max= dhms(datepart(&var._min)
,&var._max_hour, &var._max_minute, &var._max_second)
;
end;
FORMAT &var._min &var._max is8601dt.;
run;
%MEND splitIso8601dtc;
* Sample data ;
data ae;
length aestdtc $20;
input aestdtc ;
datalines;
2017-03
2017-03-01T14
2017---15
2017-03--T07:30
2017----T07:30
;
* call the macro ;
options mprint;
%splitIso8601dtc( inDat= ae, outDat= ksfe_2018, var= aestdtc )
proc print;
run;
Too many pieces to this that I don't understand. But the one I do understand is this.
Macro language %IF %THEN statements do not execute for every observation in a SAS data set. They merely generate DATA step code. So this statement can't work as intended:
%IF missing(&var._min_day) %THEN %DO;
Even adding %SYSFUNC around the MISSING function won't help. You would need to convert to DATA step IF THEN statements instead.
For debugging can eliminate the part of the macro after the PRXMATCH command, let's say eliminate code after the * impute missing datetime components; comment.
Then since you have errors in the log, show us the ENTIRE log for this abbreviated macro. Do not shows us only the errors. Do not pick and choose parts of the log to show us.
Since you have errors in a macro, you need run the macro debugging command
options mprint;
and then re-run the macro and show us the ENTIRE log. Please copy and paste the log into the window that appears when you click on the </> icon.
Hi PaigeMiller, I have edited my earlier reply with the mprint option as after re-opening the SAS, I missed to run the code to create the dataset. Now, I have run it and here are the errors in the log with mprint:
options mprint;
96
97 %splitIso8601dtc( inDat= ae, outDat= ksfe_2018, var= aestdtc )
MPRINT(SPLITISO8601DTC): DATA ksfe_2018;
MPRINT(SPLITISO8601DTC): SET ae;
MPRINT(SPLITISO8601DTC): * create regular expression ID to separate is8601dt datetime
components;
var is.............aestdtc
prxid is ...........1
ERROR: Invalid regular expression delimiter "p" in "prxid". Delimiter must be non-alphanumeric
and non-whitespace.
ERROR: The regular expression passed to the function PRXMATCH contains a syntax error.
WARNING: Argument 1 to function PRXMATCH referenced by the %SYSFUNC or %QSYSFUNC macro function
is out of range.
NOTE: Mathematical operations could not be performed during %SYSFUNC function execution. The
result of the operations have been set to a missing value.
ERROR: A character operand was found in the %EVAL function or %IF condition where a numeric
operand is required. The condition was: %sysfunc(prxmatch(prxid,&var))
ERROR: The macro SPLITISO8601DTC will stop executing.
You are trying to use macro code to work with data. That cannot work. You use the macro processor to generate SAS code. It is the job of the SAS code the macro generates to work with the data.
So this statement should not be using macro code:
%let prxid= %sysfunc(prxparse('/^(\d{4})(-(\d{2}|-)(-(\d{2}|-)(T(\d{2}|-)(:(\d{2}|-)(:(\d{2}))?)?)?)?)?\s*$/'));
And the reason it is getting an error is because you included quotes where they are not needed. So those because part of the string you passed to PRXPARSE(). So it sees the single quote character as the first character so it uses that as the separator instead of the normal / character.
Step 1: get rid of all the macro code and create a DATA step which does it for a single variable in a single dataset.
Since you only want the dataset and variable names to be dynamic, only those need to be replaced by macro parameters. There is absolutely NO need to use %IF anywhere.
DO NOT use MACRO code where you need SAS code.
%macro splitIso8601dtc
(inDat= /* Input dataset name */
,outDat= /* Output dataset name */
,var= /* Input characer variable name */
) / DES='Macro to separate ISO 8601 datetime components';
%local pattern ;
%let pattern = /^(\d{4})(-(\d{2}|-)(-(\d{2}|-)(T(\d{2}|-)(:(\d{2}|-)(:(\d{2}))?)?)?)?)?\s*$/;
%put NOTE: &sysmacroname: &=indat &=outdat &=var &=pattern;
* create new dataset from existing dataset ;
DATA &outDat;
SET &inDat;
* create regular expression ID to separate is8601dt datetime components;
if _n_=1 then prxid= prxparse("&pattern");
retain prxid ;
* separate datetime components;
if prxmatch(prxid,&var) then do;
&var._year= input(prxposn(prxid,1,&var), ??4.);
&var._min_month= input(prxposn(prxid,3,&var), ??2.);
&var._min_day= input(prxposn(prxid,5,&var), ??2.);
&var._min_hour= input(prxposn(prxid,7,&var), ??2.);
&var._min_minute= input(prxposn(prxid,9,&var), ??2.);
&var._min_second= input(prxposn(prxid,11,&var), ??2.);
* impute missing datetime components;
* impute month when missing;
if missing(&var._min_month) then do;
&var._min_month= 1;
&var._max_month= 12;
end;
else do;
&var._max_month= &var._min_month;
end;
* impute day when missing;
if missing(&var._min_day) then do;
&var._min_day= 1;
* Last day of the respective month and year;
&var._max_day=day(intnx('month',mdy(&var._max_month,1,&var._year),0,'E'));
end;
else do;
&var._max_day= &var._min_day;
end;
* impute hour when missing;
if missing(&var._min_hour) then do;
&var._min_hour= 0;
&var._max_hour= 23;
end;
else do;
&var._max_hour= &var._min_hour;
end;
* impute minute when missing;
if missing(&var._min_minute) then do;
&var._min_minute= 0;
&var._max_minute= 59;
end;
else do;
&var._max_minute= &var._min_minute;
end;
* impute second when missing;
IF missing(&var._min_second) then do;
&var._min_second= 0;
&var._max_second= 59;
end;
else do;
&var._max_second= &var._min_second;
end;
* create the full date/time value;
&var._min= dhms(mdy(&var._min_month, &var._min_day, &var._year)
,&var._min_hour, &var._min_minute, &var._min_second)
;
&var._max= dhms(datepart(&var._min)
,&var._max_hour, &var._max_minute, &var._max_second)
;
end;
FORMAT &var._min &var._max is8601dt.;
run;
%MEND splitIso8601dtc;
* Sample data ;
data ae;
length aestdtc $20;
input aestdtc ;
datalines;
2017-03
2017-03-01T14
2017---15
2017-03--T07:30
2017----T07:30
;
* call the macro ;
options mprint;
%splitIso8601dtc( inDat= ae, outDat= ksfe_2018, var= aestdtc )
proc print;
run;
Thank you very much Tom. It's working now. It helped me a lot. Now, I understand what went wrong with the code. Once again, thank you very much.
Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 16. Read more here about why you should contribute and what is in it for you!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.