<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Extract Person name and Organization name from document using proc textmine. in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Extract-Person-name-and-Organization-name-from-document-using/m-p/768060#M8882</link>
    <description>&lt;P&gt;OK.&lt;/P&gt;
&lt;P&gt;PROC TGPARSE is a procedure that is used by SAS TextMiner in SAS 9.4Mx.&lt;/P&gt;
&lt;P&gt;Normally these Enterprise Miner and Text Miner procedures still function in SAS VIYA 3.x (if you have VDMML and Visual Text Analytics licensed), but apparently this one is not.&lt;/P&gt;
&lt;P&gt;In that case, you are left with the regular expressions as an add-on to PROC TEXTMINE.&lt;/P&gt;
&lt;P&gt;But have you tried the Visual Interface? Maybe with the Visual Interface, you discover some extra bells and whistles.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Good luck,&lt;/P&gt;
&lt;P&gt;Koen&lt;/P&gt;</description>
    <pubDate>Thu, 16 Sep 2021 08:55:08 GMT</pubDate>
    <dc:creator>sbxkoenk</dc:creator>
    <dc:date>2021-09-16T08:55:08Z</dc:date>
    <item>
      <title>Extract Person name and Organization name from document using proc textmine.</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Extract-Person-name-and-Organization-name-from-document-using/m-p/767858#M8876</link>
      <description>&lt;P&gt;Hi everyone I am using proc textmine to extract names of people and organizations. However not all the names are getting extracted from the document, Is there a way to optimize the proc so that all names are extracted?&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;proc textmine data=mycas.extract;
doc_id id;
var text;
parse
   termwgt    = none
   cellwgt    = none
   reducef    = 4
   entities   = std
   outparent  = mycas.outparent
   outterms   = mycas.outterms
   outchild   = mycas.outchild
   outconfig  = mycas.outconfig
   ;
select "nlpPerson" "nlpOrganization"/group="entities" keep;
run;
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Wed, 15 Sep 2021 10:07:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Extract-Person-name-and-Organization-name-from-document-using/m-p/767858#M8876</guid>
      <dc:creator>kaziumair</dc:creator>
      <dc:date>2021-09-15T10:07:31Z</dc:date>
    </item>
    <item>
      <title>Re: Extract Person name and Organization name from document using proc textmine.</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Extract-Person-name-and-Organization-name-from-document-using/m-p/768007#M8878</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I am not sure whether you can improve your search for entities with PROC TEXTMINE.&lt;/P&gt;
&lt;P&gt;You may try to add regular expressions to search for specific patterns.&lt;/P&gt;
&lt;P&gt;You can also try to add a PROC TGPARSE (but it's older than PROC TEXTMINE so I do not expect it to be better when searching entities).&lt;/P&gt;
&lt;PRE&gt;&lt;CODE class=" language-sas"&gt;data cars;
input text $1-70;
datalines;
    The Volkswagen Polo is the World Car of the Year.
    Volkswagen won the award last year.
    Mazda sold the Mazda2 in bright green.
    The Ford Fiesta is sold in lime green.
    The Mazda2 was World Car of the Year in 2008.
;
run;
proc TGPARSE data=cars
    /* turn the entity finder on */
    entities=yes stemming=yes
    tagging=yes key=Key4 out=Out4;
    var text;
run;
/* end of program */&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Cheers,&lt;/P&gt;
&lt;P&gt;Koen&lt;/P&gt;</description>
      <pubDate>Wed, 15 Sep 2021 21:04:20 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Extract-Person-name-and-Organization-name-from-document-using/m-p/768007#M8878</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2021-09-15T21:04:20Z</dc:date>
    </item>
    <item>
      <title>Re: Extract Person name and Organization name from document using proc textmine.</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Extract-Person-name-and-Organization-name-from-document-using/m-p/768009#M8879</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;Also, on top of my previous response (see above), visit the board :&lt;/P&gt;
&lt;P&gt;Analytics &amp;gt; SAS Text and Content Analytics.&lt;/P&gt;
&lt;P&gt;It might give you some ideas (on regular expressions a.o.).&lt;/P&gt;
&lt;P&gt;And next time you have an NLP - textmine question, post it over there!&lt;/P&gt;
&lt;P&gt;Koen&lt;/P&gt;</description>
      <pubDate>Wed, 15 Sep 2021 21:09:49 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Extract-Person-name-and-Organization-name-from-document-using/m-p/768009#M8879</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2021-09-15T21:09:49Z</dc:date>
    </item>
    <item>
      <title>Re: Extract Person name and Organization name from document using proc textmine.</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Extract-Person-name-and-Organization-name-from-document-using/m-p/768026#M8880</link>
      <description>&lt;P&gt;Hi, when I ran the tgparse code, I got the following error.&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="tgparse_error.PNG" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/63646i0200E4721DC8BA89/image-size/medium?v=v2&amp;amp;px=400" role="button" title="tgparse_error.PNG" alt="tgparse_error.PNG" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 16 Sep 2021 05:04:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Extract-Person-name-and-Organization-name-from-document-using/m-p/768026#M8880</guid>
      <dc:creator>kaziumair</dc:creator>
      <dc:date>2021-09-16T05:04:12Z</dc:date>
    </item>
    <item>
      <title>Re: Extract Person name and Organization name from document using proc textmine.</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Extract-Person-name-and-Organization-name-from-document-using/m-p/768060#M8882</link>
      <description>&lt;P&gt;OK.&lt;/P&gt;
&lt;P&gt;PROC TGPARSE is a procedure that is used by SAS TextMiner in SAS 9.4Mx.&lt;/P&gt;
&lt;P&gt;Normally these Enterprise Miner and Text Miner procedures still function in SAS VIYA 3.x (if you have VDMML and Visual Text Analytics licensed), but apparently this one is not.&lt;/P&gt;
&lt;P&gt;In that case, you are left with the regular expressions as an add-on to PROC TEXTMINE.&lt;/P&gt;
&lt;P&gt;But have you tried the Visual Interface? Maybe with the Visual Interface, you discover some extra bells and whistles.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Good luck,&lt;/P&gt;
&lt;P&gt;Koen&lt;/P&gt;</description>
      <pubDate>Thu, 16 Sep 2021 08:55:08 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Extract-Person-name-and-Organization-name-from-document-using/m-p/768060#M8882</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2021-09-16T08:55:08Z</dc:date>
    </item>
  </channel>
</rss>

