<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: REGEX rule in VTA 8.5 in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/REGEX-rule-in-VTA-8-5/m-p/934363#M10870</link>
    <description>????&lt;BR /&gt;&lt;BR /&gt;data _null_;&lt;BR /&gt;a='aa'||'0D0A'x||'bb';&lt;BR /&gt;put a= $hex32.;&lt;BR /&gt;&lt;BR /&gt;a='aa'||'  '||'bb';&lt;BR /&gt;put a= $hex32.;&lt;BR /&gt;&lt;BR /&gt;run;</description>
    <pubDate>Tue, 02 Jul 2024 06:40:45 GMT</pubDate>
    <dc:creator>Ksharp</dc:creator>
    <dc:date>2024-07-02T06:40:45Z</dc:date>
    <item>
      <title>REGEX rule in VTA 8.5</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/REGEX-rule-in-VTA-8-5/m-p/934226#M10862</link>
      <description>&lt;P&gt;Hi all, I'm new to SAS and would like to extract entity out from text. The predefined concept is inaccurate so I'm trying to use custom regex rules.&lt;/P&gt;&lt;P&gt;The sample text is like:&amp;nbsp;&lt;/P&gt;&lt;P&gt;=====================&lt;/P&gt;&lt;P&gt;Email header&lt;BR /&gt;Name&lt;BR /&gt;Jon Doe&lt;BR /&gt;Designation&lt;BR /&gt;Super Spy&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Email header2&lt;BR /&gt;Full name&amp;nbsp;&lt;BR /&gt;Sam Smith Junior&lt;BR /&gt;Designation&lt;BR /&gt;Super Spy2&lt;/P&gt;&lt;P&gt;=====================&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So I defined a rule: &lt;STRONG&gt;REGEX:\n(?:Name|Full name)\s?\n[^\n]+\n&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;I intent to match based on:&lt;/P&gt;&lt;P&gt;1) Start with new line character \n&lt;/P&gt;&lt;P&gt;2) non-capturing group&amp;nbsp;(?:Name|Full name). [However I noticed that "Name" or "Full name" are still been captured regardless]&lt;/P&gt;&lt;P&gt;3) Optional space character after&amp;nbsp;Name or Full name&lt;/P&gt;&lt;P&gt;4) match a new line character \n&lt;/P&gt;&lt;P&gt;5) capture the full name using [^\n]+&lt;/P&gt;&lt;P&gt;6) End with newline character&amp;nbsp;\n&lt;/P&gt;&lt;P&gt;This REGEX rule returns 0 match, the issue seems to be from the newline character match.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Kindly advice please!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 01 Jul 2024 08:52:02 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/REGEX-rule-in-VTA-8-5/m-p/934226#M10862</guid>
      <dc:creator>SASvtaUser</dc:creator>
      <dc:date>2024-07-01T08:52:02Z</dc:date>
    </item>
    <item>
      <title>Re: REGEX rule in VTA 8.5</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/REGEX-rule-in-VTA-8-5/m-p/934227#M10863</link>
      <description>&lt;P&gt;Allow me to add some more context please! The reason I'm trying Regex instead of other means is because my source data is very unstructured, and I'm trying to Use REGEX rule type to capture very specific scenarios.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 01 Jul 2024 08:55:53 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/REGEX-rule-in-VTA-8-5/m-p/934227#M10863</guid>
      <dc:creator>SASvtaUser</dc:creator>
      <dc:date>2024-07-01T08:55:53Z</dc:date>
    </item>
    <item>
      <title>Re: REGEX rule in VTA 8.5</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/REGEX-rule-in-VTA-8-5/m-p/934354#M10868</link>
      <description>&lt;P&gt;You could use &lt;BR /&gt;\x0D\x0A&lt;BR /&gt;or&lt;BR /&gt;\x0A&lt;BR /&gt;to replace newline \n.&lt;BR /&gt;Try use $hex. format to check your newline character is 0A or 0D0A.&lt;/P&gt;</description>
      <pubDate>Tue, 02 Jul 2024 02:11:52 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/REGEX-rule-in-VTA-8-5/m-p/934354#M10868</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2024-07-02T02:11:52Z</dc:date>
    </item>
    <item>
      <title>Re: REGEX rule in VTA 8.5</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/REGEX-rule-in-VTA-8-5/m-p/934361#M10869</link>
      <description>&lt;P&gt;Could you elaborate on how to check&amp;nbsp;&lt;SPAN&gt;0A or 0D0A (using&amp;nbsp;$hex.?) in VTA please?&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 02 Jul 2024 06:15:16 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/REGEX-rule-in-VTA-8-5/m-p/934361#M10869</guid>
      <dc:creator>SASvtaUser</dc:creator>
      <dc:date>2024-07-02T06:15:16Z</dc:date>
    </item>
    <item>
      <title>Re: REGEX rule in VTA 8.5</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/REGEX-rule-in-VTA-8-5/m-p/934363#M10870</link>
      <description>????&lt;BR /&gt;&lt;BR /&gt;data _null_;&lt;BR /&gt;a='aa'||'0D0A'x||'bb';&lt;BR /&gt;put a= $hex32.;&lt;BR /&gt;&lt;BR /&gt;a='aa'||'  '||'bb';&lt;BR /&gt;put a= $hex32.;&lt;BR /&gt;&lt;BR /&gt;run;</description>
      <pubDate>Tue, 02 Jul 2024 06:40:45 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/REGEX-rule-in-VTA-8-5/m-p/934363#M10870</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2024-07-02T06:40:45Z</dc:date>
    </item>
  </channel>
</rss>

