Hi
I am trying to find a pattern from a sentence e.g I am trying find " I am" and "live" and "with my parents" from "I am 20 years old and I live happily with my parents".
The code I tried is ;
data test;
sentence='I am 20 years old and I live happily with my parents';
pattern=prxparse('/(I am)\w*(live)\w*(with my parents)/i');
b=prxmatch(pattern,sentence);
run:
I am getting 0 value for b
Any help in this regards will be very much appreciated.
Your aren't incorporating the blanks between the words you want to find and all other chars in the text. The following regex seems to solve the issue:
(I am)(\w*\W)*(live)(\w*\W)*(with my parents)
Your aren't incorporating the blanks between the words you want to find and all other chars in the text. The following regex seems to solve the issue:
(I am)(\w*\W)*(live)(\w*\W)*(with my parents)
It seems that the string (\w*\W)* is used to skip any number of words.
Getting result b=1 means that the sentence contains all required substrings
in the defined order.
Next code is searching each sub string, if it exists in the sentence:
data _null_;
sentence='I am 20 years old and I live happily with my parents';
prxid = prxparse('/(I am)|(live)|(my parents)/');
length unit $20;
start=1;
stop=length(sentence);
do while (1);
call prxnext(prxid,start,stop,sentence,pos,len);
if pos=0 then leave;
unit = compress(substr(sentence,pos,len));
put unit=;
end;
run;
Good news: We've extended SAS Hackathon registration until Sept. 12, so you still have time to be part of our biggest event yet – our five-year anniversary!
Learn how use the CAT functions in SAS to join values from multiple variables into a single value.
Find more tutorials on the SAS Users YouTube channel.
Ready to level-up your skills? Choose your own adventure.