I have a query like so:
proc sql;
create table test_query as
select *
from lib.dataset
where something = 1
and (description contains ("ABC")
or description contains ("DEF")
or description contains ("GHI")
or description contains ("JKL")
or description contains ("MNO")
or description contains ("PQR"))
;
quit;
Is there a way to do something like "description LIKE IN ('%ABC%', '%DEF%', '%GHI%', ...)' that would avoid costly ORs within the query? Or something more efficient that I'm not thinking of?
How about PRXMATCH() ?
proc print data=sashelp.class;
where prxmatch('/B|M/i',name);
run;
Obs Name Sex Age Height Weight 3 Barbara F 13 65.3 98 6 James M 12 57.3 83 14 Mary F 15 66.5 112 16 Robert M 12 64.8 128 18 Thomas M 11 57.5 85 19 William M 15 66.5 112
How about PRXMATCH() ?
proc print data=sashelp.class;
where prxmatch('/B|M/i',name);
run;
Obs Name Sex Age Height Weight 3 Barbara F 13 65.3 98 6 James M 12 57.3 83 14 Mary F 15 66.5 112 16 Robert M 12 64.8 128 18 Thomas M 11 57.5 85 19 William M 15 66.5 112
Thanks, @Tom! I can't believe I didn't think of regexes.
Interestingly, the performance didn't seem much better, a sign that maybe the original filters aren't so bad.
But this definitely cleans-up the code quite a bit & gives more flexibility for automated query construction.
Registration is open! SAS is returning to Vegas for an AI and analytics experience like no other! Whether you're an executive, manager, end user or SAS partner, SAS Innovate is designed for everyone on your team. Register for just $495 by 12/31/2023.
If you are interested in speaking, there is still time to submit a session idea. More details are posted on the website.
Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.
Find more tutorials on the SAS Users YouTube channel.