<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Extracting street address from a free text document in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/877177#M10509</link>
    <description>&lt;P&gt;What's the scope of your use case? How many addresses are you dealing with spread across which countries? How accurate do these addresses have to be? Do you need to accurately locate and geocode them or not? Dataflux / Data Quality has a significant learning curve and effort so is really only worthwhile if you have a lot of addresses that MUST be accurately verified / located / geocoded. It is not worthwhile for low address volumes and where accurate verification and location is not required. A DIY approach with Base SAS tools is more appropriate for these. If you already have experience with Text Mining and SAS DQ then the answers might be different.&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 23 May 2023 22:15:48 GMT</pubDate>
    <dc:creator>SASKiwi</dc:creator>
    <dc:date>2023-05-23T22:15:48Z</dc:date>
    <item>
      <title>Extracting street address from a free text document</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/877073#M10506</link>
      <description>&lt;P&gt;Hi Experts,&lt;/P&gt;
&lt;P&gt;It's the first time I am working on such project and I would like some advices.&lt;BR /&gt;The scope of the project is to extract from free-text documents street addresses, if provided in the text.&lt;/P&gt;
&lt;P&gt;I am attempting to extract street addresses from the text documents and I was thinking of using SAS Visual Text Analytics, &lt;BR /&gt;because from the documentation, it should contain the functionality to do the task. &lt;BR /&gt;&lt;BR /&gt;Finally, the text documents to be analyzed are not US ones but in foreign language covered by SAS Visual Text Analytics.&lt;/P&gt;
&lt;P&gt;Do you have experiences in this regard to suggest me a feasible approach?&lt;/P&gt;</description>
      <pubDate>Tue, 23 May 2023 13:35:47 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/877073#M10506</guid>
      <dc:creator>sassy7</dc:creator>
      <dc:date>2023-05-23T13:35:47Z</dc:date>
    </item>
    <item>
      <title>Re: Extracting street address from a free text document</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/877080#M10507</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/206213"&gt;@sassy7&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;As you shouldn't do this by plain SAS programming (e.g. with the data step)&lt;/P&gt;
&lt;P&gt;, I have moved your post from the "SAS Programming"-board to the "SAS Data Science"-board.&lt;/P&gt;
&lt;P&gt;You need to use SAS Text Miner (SAS 9.4) or SAS Visual Text Analytics (VTA in SAS VIYA) indeed.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Look here :&lt;/P&gt;
&lt;UL class="lia-list-style-type-square"&gt;
&lt;LI&gt;Extracting Information from Text Documents in Business Contexts&lt;BR /&gt;Started ‎08-02-2019 | Modified ‎08-02-2019 | Views 2,931&lt;BR /&gt;&lt;A href="https://communities.sas.com/t5/SAS-Communities-Library/Extracting-Information-from-Text-Documents-in-Business-Contexts/ta-p/578740" target="_blank"&gt;https://communities.sas.com/t5/SAS-Communities-Library/Extracting-Information-from-Text-Documents-in-Business-Contexts/ta-p/578740&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Extracting Information from Text Documents in Business Contexts - Part Two&lt;BR /&gt;Started ‎08-09-2019 | Modified ‎08-09-2019 | Views 2,415&lt;BR /&gt;&lt;A href="https://communities.sas.com/t5/SAS-Communities-Library/Extracting-Information-from-Text-Documents-in-Business-Contexts/ta-p/580151" target="_blank"&gt;https://communities.sas.com/t5/SAS-Communities-Library/Extracting-Information-from-Text-Documents-in-Business-Contexts/ta-p/580151&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;User’s Guide&lt;BR /&gt;SAS® Visual Text Analytics 8.5: User’s Guide&lt;BR /&gt;&lt;A href="https://go.documentation.sas.com/doc/en/ctxtcdc/8.5/ctxtug/titlepage.htm" target="_blank"&gt;https://go.documentation.sas.com/doc/en/ctxtcdc/8.5/ctxtug/titlepage.htm&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Explore NLP (predifined categories) and LITI rules (Concepts node) !&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Koen&lt;/P&gt;</description>
      <pubDate>Tue, 23 May 2023 14:24:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/877080#M10507</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2023-05-23T14:24:21Z</dc:date>
    </item>
    <item>
      <title>Re: Extracting street address from a free text document</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/877114#M10508</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/8525"&gt;@koen&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;Many thanks for te info provided!&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;Although street addresses are predefined concepts (nlpPlace), using custom concepts would requires writing the liti/regex syntax: could give me some tips?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Also in the SAS Community blogs I have read several posts citing SAS Dataflux/SAS Data Quality as the best tool for address parsing/standardization (where parsing is not Text Analytics parsing).&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I am wondering which tool could be the most feasible one or whether both tools are required-- e.g., Visual Text Analytics for concept extraction and SAS Dataflux/SAS Data Quality for address standardization.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Do you have experiences in these aspects to suggest me a feasible approach?&lt;/P&gt;</description>
      <pubDate>Tue, 23 May 2023 16:17:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/877114#M10508</guid>
      <dc:creator>sassy7</dc:creator>
      <dc:date>2023-05-23T16:17:12Z</dc:date>
    </item>
    <item>
      <title>Re: Extracting street address from a free text document</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/877177#M10509</link>
      <description>&lt;P&gt;What's the scope of your use case? How many addresses are you dealing with spread across which countries? How accurate do these addresses have to be? Do you need to accurately locate and geocode them or not? Dataflux / Data Quality has a significant learning curve and effort so is really only worthwhile if you have a lot of addresses that MUST be accurately verified / located / geocoded. It is not worthwhile for low address volumes and where accurate verification and location is not required. A DIY approach with Base SAS tools is more appropriate for these. If you already have experience with Text Mining and SAS DQ then the answers might be different.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 23 May 2023 22:15:48 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/877177#M10509</guid>
      <dc:creator>SASKiwi</dc:creator>
      <dc:date>2023-05-23T22:15:48Z</dc:date>
    </item>
    <item>
      <title>Re: Extracting street address from a free text document</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/877290#M10510</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/8525"&gt;@koen&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;So the choice about which application to use (e.g. SAS Visual Text Analytics and/or SAS Dataflux/SAS Data Quality and/or Base SAS) will depend on the numbers&amp;nbsp; of concepts extracted&amp;nbsp; in the texts.&lt;BR /&gt;One more thing, do you know whether SAS Visual Text Analytics provides a macro similar to %TMFILTER to filter text documents?&lt;/P&gt;</description>
      <pubDate>Wed, 24 May 2023 15:18:32 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/877290#M10510</guid>
      <dc:creator>sassy7</dc:creator>
      <dc:date>2023-05-24T15:18:32Z</dc:date>
    </item>
    <item>
      <title>Re: Extracting street address from a free text document</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/877319#M10511</link>
      <description>&lt;P&gt;Hello &lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/206213"&gt;@sassy7&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;UL class="lia-list-style-type-square"&gt;
&lt;LI&gt;I do not know SAS Data Quality using DataFlux very well, so I cannot comment on that.&lt;/LI&gt;
&lt;LI&gt;In the data step, &lt;BR /&gt;you can try to accomplish your task with all kinds of (parsing) functions and regex.&lt;BR /&gt;Both SAS regular expressions (the RX functions) and Perl regular expressions (the PRX functions) allow you to locate patterns in text strings.&lt;/LI&gt;
&lt;LI&gt;You can definitely use SAS Text Miner (last version 15.3) or SAS Visual Text Analytics (you need SAS VIYA for that and VTA licensed of course).&lt;BR /&gt;The %tmfilter macro&amp;nbsp;&lt;SPAN&gt;can be used to retrieve the documents into a SAS dataset. %tmfilter can also act as a&amp;nbsp;Web crawler macro. %tmfilter is only in SAS Text Miner and NOT in VTA (but similar functionality is there in VTA of course).&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;SPAN&gt;Doc %tmfilter macro :&amp;nbsp;&lt;BR /&gt;&lt;A href="https://go.documentation.sas.com/doc/en/tmref/15.3/n1f1hnf1pk8w3in1i2h4v94rty2m.htm" target="_blank"&gt;https://go.documentation.sas.com/doc/en/tmref/15.3/n1f1hnf1pk8w3in1i2h4v94rty2m.htm&lt;/A&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;SAS® Visual Text Analytics&lt;BR /&gt;&lt;A href="https://support.sas.com/en/software/visual-text-analytics-support.html" target="_blank"&gt;https://support.sas.com/en/software/visual-text-analytics-support.html&lt;/A&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Cheers,&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;Koen&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 24 May 2023 17:17:24 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/877319#M10511</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2023-05-24T17:17:24Z</dc:date>
    </item>
    <item>
      <title>Re: Extracting street address from a free text document</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/877446#M10514</link>
      <description>&lt;P&gt;Thanks a lot&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/8525"&gt;@koen&lt;/a&gt;!&lt;/P&gt;</description>
      <pubDate>Thu, 25 May 2023 07:01:20 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/877446#M10514</guid>
      <dc:creator>sassy7</dc:creator>
      <dc:date>2023-05-25T07:01:20Z</dc:date>
    </item>
    <item>
      <title>Re: Extracting street address from a free text document</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/877458#M10515</link>
      <description>&lt;P&gt;Also... do you know where I can find documentation about the SAS Text Miner procedures, like the ones available in SAS Visual Text Analytics (proc boolrule, proc textmine and proc tmscore). I found documents about the HP procedures of SAS Text Miner but not about the "standard" procedure used for SAS Text Miner - it would be much easier to use the procedures instead of the UI environment. Or the procedures are the same for the two environments?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 25 May 2023 08:20:01 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/877458#M10515</guid>
      <dc:creator>sassy7</dc:creator>
      <dc:date>2023-05-25T08:20:01Z</dc:date>
    </item>
    <item>
      <title>Re: Extracting street address from a free text document</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/877470#M10516</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;This is the official doc :&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;A href="https://support.sas.com/en/software/text-miner-support.html" target="_blank" rel="noopener"&gt;https://support.sas.com/en/software/text-miner-support.html&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;A href="https://go.documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/tmhpprcref/titlepage.htm" target="_blank" rel="noopener"&gt;https://go.documentation.sas.com/doc/en/pgmsascdc/9.4_3.5/tmhpprcref/titlepage.htm&lt;/A&gt;&lt;BR /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;(SAS Text Miner: High-Performance Procedures)&lt;/LI&gt;
&lt;LI&gt;&lt;A href="https://support.sas.com/documentation/onlinedoc/txtminer/" target="_blank" rel="noopener"&gt;https://support.sas.com/documentation/onlinedoc/txtminer/&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;For the standard SAS Text Miner procedures (NOT the HP-procedures), you need to contact SAS Technical Support in your region / country. They will provide you with the doc such that you will be able to run the procedures without using the UI. I am not allowed to share that doc.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Added note :&lt;BR /&gt;The reason that that documentation for the Text Miner procedures is being withheld is that there is intelligence in the UI. The UI makes sure you can't place certain conflicting options, for example. If you use the procedures directly you don't have that protection and then Technical Support gets all sorts of avoidable questions (questions that are avoided if you use the UI as provided and as designed).&lt;BR /&gt;But OK, if you explicitly ask for that documentation and Technical Support can assess that you "know what you're doing" you're going to get that procedures documentation.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks,&lt;/P&gt;
&lt;P&gt;Koen&lt;/P&gt;</description>
      <pubDate>Thu, 25 May 2023 09:21:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/877470#M10516</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2023-05-25T09:21:30Z</dc:date>
    </item>
    <item>
      <title>Re: Extracting street address from a free text document</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/877481#M10517</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/8525"&gt;@koen&lt;/a&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;will do! Thanks a lot for your help!&lt;/P&gt;</description>
      <pubDate>Thu, 25 May 2023 10:02:41 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/877481#M10517</guid>
      <dc:creator>sassy7</dc:creator>
      <dc:date>2023-05-25T10:02:41Z</dc:date>
    </item>
    <item>
      <title>Re: Extracting street address from a free text document</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/888272#M10560</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;As you are aware, the pre-defined concept "nlpPlace" can extract street addresses. If this does not meet your requirements, you can also customize concepts by writing LITI rules.&lt;/P&gt;
&lt;P&gt;For instance, you can start by defining concepts that constitute address components, such as StreetName, StreetType, City, State, Country, and so on. Afterwards, you can combine these concepts following language conventions. For example:&lt;BR /&gt;CONCEPT: StreetName, City, State, Country.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;For guidance on writing LITI rules, please refer to the page: &lt;A href="https://go.documentation.sas.com/doc/en/ctxtcdc/v_015/ctxtug/p1kf71w7npr9ecn1gysvovfs42x2.htm" target="_blank"&gt;https://go.documentation.sas.com/doc/en/ctxtcdc/v_015/ctxtug/p1kf71w7npr9ecn1gysvovfs42x2.htm&lt;/A&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Hope that helps.&lt;/P&gt;</description>
      <pubDate>Tue, 08 Aug 2023 01:43:32 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Extracting-street-address-from-a-free-text-document/m-p/888272#M10560</guid>
      <dc:creator>Meilan</dc:creator>
      <dc:date>2023-08-08T01:43:32Z</dc:date>
    </item>
  </channel>
</rss>

