<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic need help for text analysis through SAS EG. in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/need-help-for-text-analysis-through-SAS-EG/m-p/522454#M141847</link>
    <description>&lt;P&gt;Hi TEAM,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I'm currently working on text analysis of review data for specific product from amazon data on &lt;STRONG&gt;Base SAS&lt;/STRONG&gt; (SAS EG).&lt;/P&gt;&lt;P&gt;As you know, there are so many process&amp;nbsp; for text analysis like eliminating stop word, stemming or lemmatization, N-gram and beg of word (document term matrix).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;As of now i have done some step ,which is mention below&amp;nbsp; and need some help ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1. Prepared stop word list and eliminated from row data .&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;&lt;STRONG&gt;&lt;EM&gt;Is there any way to do tag all word on the basis of part of speech process through SAS EG OR any code from which i can do easily&lt;/EM&gt;?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;2. STEMMING :-&amp;nbsp; For this process, i got idea and code from google and sas communities, but after using my data on this process i got output data , on which some value is not meaningful for example :-&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;&lt;FONT color="#FF6600"&gt;active&lt;/FONT&gt;&lt;/TD&gt;&lt;TD&gt;&lt;FONT color="#FF6600"&gt;activ&lt;/FONT&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;adobe&lt;/TD&gt;&lt;TD&gt;adob&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;adapted&lt;/TD&gt;&lt;TD&gt;adapt&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;adapter&lt;/TD&gt;&lt;TD&gt;adapt&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;FONT color="#FF6600"&gt;aceing&lt;/FONT&gt;&lt;/TD&gt;&lt;TD&gt;&lt;FONT color="#FF6600"&gt;ac&lt;/FONT&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;FONT color="#FF6600"&gt;activities&lt;/FONT&gt;&lt;/TD&gt;&lt;TD&gt;&lt;FONT color="#FF6600"&gt;activ&lt;/FONT&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;a&lt;FONT color="#FF6600"&gt;ccident&lt;/FONT&gt;&lt;/TD&gt;&lt;TD&gt;&lt;FONT color="#FF6600"&gt;accid&lt;/FONT&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;advertised&lt;/TD&gt;&lt;TD&gt;advertis&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&amp;nbsp;so i thought the best way to use &lt;STRONG&gt;lemmatization&lt;/STRONG&gt; ,forming all relevant word in to root word (dictionary word), then i can get useful value that is useful for further analysis like if my data having "good" ,"best", "better"and after using lemmatization process i got "GOOD" and if i calculate freq of word then i will get 3 for good value.&lt;/P&gt;&lt;P&gt;so &lt;STRONG&gt;how to write code for this process , any help or idea on this or any code?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;3.&lt;/STRONG&gt; N-GRAM&amp;nbsp;- As per my data, i have used up to TRI-gram (uni, bi and tri gram). after getting output dataset i don't know how to do next process or on which basis will i pick useful observation? from below&amp;nbsp; example :-&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In below screen shot, there is one more column i have and that is ID (for security purpose i did not mention) and this is for only one ID suppose in below output data for 102 ID VALUE.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;&lt;STRONG&gt;GrAM_PROCESS&lt;/STRONG&gt;&lt;/TD&gt;&lt;TD&gt;&lt;STRONG&gt;star_rating&lt;/STRONG&gt;&lt;/TD&gt;&lt;TD&gt;&lt;STRONG&gt;text&lt;/STRONG&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;basic&lt;/TD&gt;&lt;TD&gt;3&lt;/TD&gt;&lt;TD&gt;basic productfeedback great uni&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;basic productfeedback&lt;/TD&gt;&lt;TD&gt;3&lt;/TD&gt;&lt;TD&gt;basic productfeedback great uni&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;basic productfeedback great&lt;/TD&gt;&lt;TD&gt;3&lt;/TD&gt;&lt;TD&gt;basic productfeedback great uni&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;productfeedback&lt;/TD&gt;&lt;TD&gt;3&lt;/TD&gt;&lt;TD&gt;basic productfeedback great uni&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;productfeedback great&lt;/TD&gt;&lt;TD&gt;3&lt;/TD&gt;&lt;TD&gt;basic productfeedback great uni&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;productfeedback great uni&lt;/TD&gt;&lt;TD&gt;3&lt;/TD&gt;&lt;TD&gt;basic productfeedback great uni&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;great&lt;/TD&gt;&lt;TD&gt;3&lt;/TD&gt;&lt;TD&gt;basic productfeedback great uni&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;great uni&lt;/TD&gt;&lt;TD&gt;3&lt;/TD&gt;&lt;TD&gt;basic productfeedback great uni&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;SO, &lt;STRONG&gt;how to pick useful observation from above example ?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;ANY HELP IS very much appreciate.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;and if i have posted in wrong community please suggest me with link,so i can post on that link?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;THANKS IN ADVANCE&amp;nbsp;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Regards,&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;ANU Singh&lt;/STRONG&gt;&lt;/P&gt;</description>
    <pubDate>Wed, 19 Dec 2018 08:40:04 GMT</pubDate>
    <dc:creator>anuranjansngh0</dc:creator>
    <dc:date>2018-12-19T08:40:04Z</dc:date>
    <item>
      <title>need help for text analysis through SAS EG.</title>
      <link>https://communities.sas.com/t5/SAS-Programming/need-help-for-text-analysis-through-SAS-EG/m-p/522454#M141847</link>
      <description>&lt;P&gt;Hi TEAM,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I'm currently working on text analysis of review data for specific product from amazon data on &lt;STRONG&gt;Base SAS&lt;/STRONG&gt; (SAS EG).&lt;/P&gt;&lt;P&gt;As you know, there are so many process&amp;nbsp; for text analysis like eliminating stop word, stemming or lemmatization, N-gram and beg of word (document term matrix).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;As of now i have done some step ,which is mention below&amp;nbsp; and need some help ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1. Prepared stop word list and eliminated from row data .&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;&lt;STRONG&gt;&lt;EM&gt;Is there any way to do tag all word on the basis of part of speech process through SAS EG OR any code from which i can do easily&lt;/EM&gt;?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;2. STEMMING :-&amp;nbsp; For this process, i got idea and code from google and sas communities, but after using my data on this process i got output data , on which some value is not meaningful for example :-&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;&lt;FONT color="#FF6600"&gt;active&lt;/FONT&gt;&lt;/TD&gt;&lt;TD&gt;&lt;FONT color="#FF6600"&gt;activ&lt;/FONT&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;adobe&lt;/TD&gt;&lt;TD&gt;adob&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;adapted&lt;/TD&gt;&lt;TD&gt;adapt&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;adapter&lt;/TD&gt;&lt;TD&gt;adapt&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;FONT color="#FF6600"&gt;aceing&lt;/FONT&gt;&lt;/TD&gt;&lt;TD&gt;&lt;FONT color="#FF6600"&gt;ac&lt;/FONT&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;&lt;FONT color="#FF6600"&gt;activities&lt;/FONT&gt;&lt;/TD&gt;&lt;TD&gt;&lt;FONT color="#FF6600"&gt;activ&lt;/FONT&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;a&lt;FONT color="#FF6600"&gt;ccident&lt;/FONT&gt;&lt;/TD&gt;&lt;TD&gt;&lt;FONT color="#FF6600"&gt;accid&lt;/FONT&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;advertised&lt;/TD&gt;&lt;TD&gt;advertis&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&amp;nbsp;so i thought the best way to use &lt;STRONG&gt;lemmatization&lt;/STRONG&gt; ,forming all relevant word in to root word (dictionary word), then i can get useful value that is useful for further analysis like if my data having "good" ,"best", "better"and after using lemmatization process i got "GOOD" and if i calculate freq of word then i will get 3 for good value.&lt;/P&gt;&lt;P&gt;so &lt;STRONG&gt;how to write code for this process , any help or idea on this or any code?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;3.&lt;/STRONG&gt; N-GRAM&amp;nbsp;- As per my data, i have used up to TRI-gram (uni, bi and tri gram). after getting output dataset i don't know how to do next process or on which basis will i pick useful observation? from below&amp;nbsp; example :-&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In below screen shot, there is one more column i have and that is ID (for security purpose i did not mention) and this is for only one ID suppose in below output data for 102 ID VALUE.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;&lt;STRONG&gt;GrAM_PROCESS&lt;/STRONG&gt;&lt;/TD&gt;&lt;TD&gt;&lt;STRONG&gt;star_rating&lt;/STRONG&gt;&lt;/TD&gt;&lt;TD&gt;&lt;STRONG&gt;text&lt;/STRONG&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;basic&lt;/TD&gt;&lt;TD&gt;3&lt;/TD&gt;&lt;TD&gt;basic productfeedback great uni&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;basic productfeedback&lt;/TD&gt;&lt;TD&gt;3&lt;/TD&gt;&lt;TD&gt;basic productfeedback great uni&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;basic productfeedback great&lt;/TD&gt;&lt;TD&gt;3&lt;/TD&gt;&lt;TD&gt;basic productfeedback great uni&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;productfeedback&lt;/TD&gt;&lt;TD&gt;3&lt;/TD&gt;&lt;TD&gt;basic productfeedback great uni&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;productfeedback great&lt;/TD&gt;&lt;TD&gt;3&lt;/TD&gt;&lt;TD&gt;basic productfeedback great uni&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;productfeedback great uni&lt;/TD&gt;&lt;TD&gt;3&lt;/TD&gt;&lt;TD&gt;basic productfeedback great uni&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;great&lt;/TD&gt;&lt;TD&gt;3&lt;/TD&gt;&lt;TD&gt;basic productfeedback great uni&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;great uni&lt;/TD&gt;&lt;TD&gt;3&lt;/TD&gt;&lt;TD&gt;basic productfeedback great uni&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;SO, &lt;STRONG&gt;how to pick useful observation from above example ?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;ANY HELP IS very much appreciate.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;and if i have posted in wrong community please suggest me with link,so i can post on that link?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;THANKS IN ADVANCE&amp;nbsp;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Regards,&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;ANU Singh&lt;/STRONG&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 19 Dec 2018 08:40:04 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/need-help-for-text-analysis-through-SAS-EG/m-p/522454#M141847</guid>
      <dc:creator>anuranjansngh0</dc:creator>
      <dc:date>2018-12-19T08:40:04Z</dc:date>
    </item>
    <item>
      <title>Re: need help for text analysis through SAS EG.</title>
      <link>https://communities.sas.com/t5/SAS-Programming/need-help-for-text-analysis-through-SAS-EG/m-p/522748#M141970</link>
      <description>&lt;P&gt;I do not understand Q1 and Q3.&lt;/P&gt;
&lt;P&gt;About Q2: What is the&amp;nbsp; process you want&amp;nbsp;help on? trimming the words to their root? are the words in sentences?&lt;/P&gt;</description>
      <pubDate>Wed, 19 Dec 2018 23:25:49 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/need-help-for-text-analysis-through-SAS-EG/m-p/522748#M141970</guid>
      <dc:creator>ChrisNZ</dc:creator>
      <dc:date>2018-12-19T23:25:49Z</dc:date>
    </item>
    <item>
      <title>Re: need help for text analysis through SAS EG.</title>
      <link>https://communities.sas.com/t5/SAS-Programming/need-help-for-text-analysis-through-SAS-EG/m-p/522800#M141989</link>
      <description>&lt;P&gt;Hi @&lt;A href="https://communities.sas.com/t5/user/viewprofilepage/user-id/16961" target="_self"&gt;&lt;SPAN class="login-bold"&gt;ChrisNZ&lt;/SPAN&gt;&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you for your reply.&amp;nbsp;&lt;/P&gt;&lt;P&gt;please see the below , in which i have&amp;nbsp;explain&amp;nbsp;more clear of my doubt .&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For question 1 : Need to tag variable values as a part of speech process like Noun, Adj, Adv, Verb. Please see the below screenshot :-&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;beats&lt;/TD&gt;&lt;TD&gt;Verb&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;beautiful&lt;/TD&gt;&lt;TD&gt;Adj&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;beautifully&lt;/TD&gt;&lt;TD&gt;Adv&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;became&lt;/TD&gt;&lt;TD&gt;Verb&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;become&lt;/TD&gt;&lt;TD&gt;Verb&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;become&lt;/TD&gt;&lt;TD&gt;Verb&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;becomes&lt;/TD&gt;&lt;TD&gt;Verb&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;bed&lt;/TD&gt;&lt;TD&gt;Noun&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;bedside&lt;/TD&gt;&lt;TD&gt;Noun&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;been&lt;/TD&gt;&lt;TD&gt;Verb&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;before&lt;/TD&gt;&lt;TD&gt;Adv&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;began&lt;/TD&gt;&lt;TD&gt;Verb&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD&gt;begin&lt;/TD&gt;&lt;TD&gt;Verb&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;so in this basis i can eliminate easily preposition and also some meaningless of word for analysis.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For question 2 :-&lt;/P&gt;&lt;P&gt;As i have mention some value for example&amp;nbsp; (in my first post), i need to convert the variable value to dictionary format for example&lt;/P&gt;&lt;P&gt;some observation is having "GOOD" , "BETTER" "BEST"&amp;nbsp; AND i want to do dictionary format for first form of word as "good" "good" "good". please see the example below :-&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;HAVING&amp;nbsp;DATA&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; WANT DATA&lt;/P&gt;&lt;P&gt;good&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;good&lt;/P&gt;&lt;P&gt;better&lt;SPAN&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;&amp;nbsp;good&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;best&lt;SPAN&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; good&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;become&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;become&lt;/P&gt;&lt;P&gt;becomes&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;become&lt;/P&gt;&lt;P&gt;became&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;become&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For question 3:-&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;After getting outdataset from N-Gram process, what should i do for further process? please see the screen shot on above post.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;PLEASE HELP ME TO GET DESIRE OUTPUT.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Once again thanks a lot.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Anu Singh&lt;/P&gt;</description>
      <pubDate>Thu, 20 Dec 2018 06:19:16 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/need-help-for-text-analysis-through-SAS-EG/m-p/522800#M141989</guid>
      <dc:creator>anuranjansngh0</dc:creator>
      <dc:date>2018-12-20T06:19:16Z</dc:date>
    </item>
    <item>
      <title>Re: need help for text analysis through SAS EG.</title>
      <link>https://communities.sas.com/t5/SAS-Programming/need-help-for-text-analysis-through-SAS-EG/m-p/522965#M142033</link>
      <description>&lt;P&gt;Q1.&lt;/P&gt;
&lt;P&gt;If&amp;nbsp; the goal is to eliminate prepositions, you are better off looking for them. A list is&amp;nbsp;&lt;A href="https://www.englishclub.com/grammar/prepositions-list.htm" target="_self"&gt;here&lt;/A&gt;.&lt;/P&gt;
&lt;P&gt;If the goal is to tag grammatical usage, this requires a powerful language-parsing algorithm that is well beyond the point (or the capability) of these pages.&lt;/P&gt;
&lt;P&gt;Many words can be a noun and a verb in English (like &lt;EM&gt;beats&lt;/EM&gt;). Some can be verb, noun and adjective (like &lt;EM&gt;swell&lt;/EM&gt;). &lt;EM&gt;fast&lt;/EM&gt; can be all three as well as an adverb.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Q2.&lt;/P&gt;
&lt;P&gt;2.1&amp;nbsp;You need a list of all words (&lt;A href="https://github.com/dwyl/english-words" target="_self"&gt;This list &lt;/A&gt;contains half a million entries and&amp;nbsp;includes plurals and verb forms and superlatives).&lt;/P&gt;
&lt;P&gt;2.2 To that list you need to add a second column containing the root word, so that &lt;EM&gt;went&lt;/EM&gt; can be mapped to &lt;EM&gt;go &lt;/EM&gt;and &lt;EM&gt;best&lt;/EM&gt; to &lt;EM&gt;good&lt;/EM&gt;.&lt;/P&gt;
&lt;P&gt;2.3 Then you need to match to your actual phrases.&lt;/P&gt;
&lt;P&gt;The key&amp;nbsp;for a fast match is to use equijoins. So no looking for a word in a sentence with functions &lt;FONT face="courier new,courier"&gt;index&lt;/FONT&gt;() or &lt;FONT face="courier new,courier"&gt;substr&lt;/FONT&gt;() or operator LIKE. You need to make a table of your text with one word per observation and then match using the = operator.&lt;/P&gt;
&lt;P&gt;2.4 Of course this does not account for spelling errors.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 20 Dec 2018 21:38:23 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/need-help-for-text-analysis-through-SAS-EG/m-p/522965#M142033</guid>
      <dc:creator>ChrisNZ</dc:creator>
      <dc:date>2018-12-20T21:38:23Z</dc:date>
    </item>
    <item>
      <title>Re: need help for text analysis through SAS EG.</title>
      <link>https://communities.sas.com/t5/SAS-Programming/need-help-for-text-analysis-through-SAS-EG/m-p/523362#M142185</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/16961"&gt;@ChrisNZ&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Sorry for late reply.&lt;/P&gt;&lt;P&gt;Thank you for&amp;nbsp;your reply and suggestion.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;for question 1 . As per your suggestion, yes you are right but as per sentence(input data), we can find out part of speech like noun, verb. so is there any way to tag the word from sentence as per part of speech process, so we can easily filter the word as per classification(noun, pronoun, adj, adv) . if you have any idea on this then&amp;nbsp; please suggest me with code it'll be&amp;nbsp; helpful for me to complete my assign task.Please see the NOTE section.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For question 2.&amp;nbsp; This suggestion is quite appreciable even i thought the same but it'll take&amp;nbsp;longer time to&amp;nbsp;create dictionary file (tagging root word).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Note :-&amp;nbsp; In the Base SAS, there is some procedure like &lt;STRONG&gt;PROC HPTMINE&lt;/STRONG&gt; (i find out last week), even i used this procedure but from this it comes with up 4 output dataset&amp;nbsp; i.e. outterm (get part of speech, frequency of word) , outchild, outparent, outconfig.&lt;/P&gt;&lt;P&gt;My problem in this procedure, i don't understand the outparent , outchild, outconfig output data set and what will i do further for this dataset.&lt;/P&gt;&lt;P&gt;&amp;nbsp;As per my requirement for time being, i need to calculate frequency of word by Rating(1-5 rating my data) and id (or Observation wise).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For example :-&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;input data&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;ID&amp;nbsp; &amp;nbsp; &amp;nbsp;TEXT&lt;/P&gt;&lt;P&gt;&amp;nbsp;1&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;There is a nice product and good for programmer Thanks flipkart. Nice laptop for study and usage.&lt;/P&gt;&lt;P&gt;&amp;nbsp;2&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; Very good, comes with windows10 &amp;amp; ms office &amp;amp; student 2016. Thanks.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;WANT :-&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;ID&amp;nbsp; &amp;nbsp;NICE&amp;nbsp; &amp;nbsp;USAGE&amp;nbsp; &amp;nbsp; VERY&amp;nbsp; &amp;nbsp; GOOD&lt;/P&gt;&lt;P&gt;&amp;nbsp;1&amp;nbsp; &amp;nbsp; &amp;nbsp;2&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1&lt;/P&gt;&lt;P&gt;&amp;nbsp;2&amp;nbsp; &amp;nbsp; &amp;nbsp;0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;It would be really appreciable&amp;nbsp; if you&amp;nbsp;could provide me guidance and code&amp;nbsp;for my issue.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Wishing you a very merry Christmas&amp;nbsp;in advance &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Regards,&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Anu Singh&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 24 Dec 2018 09:07:24 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/need-help-for-text-analysis-through-SAS-EG/m-p/523362#M142185</guid>
      <dc:creator>anuranjansngh0</dc:creator>
      <dc:date>2018-12-24T09:07:24Z</dc:date>
    </item>
  </channel>
</rss>

