<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Is there a way to breakup a proc tmspell run into a set of smaller tasks? in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Is-there-a-way-to-breakup-a-proc-tmspell-run-into-a-set-of/m-p/96974#M782</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I'm trying to run proc tmspell on a file that has almost 2 million entries.&amp;nbsp; The proc has been running for the past 20 hours and I have no way of knowing which will come first: the proc's completion or a power outage.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Is it possible to break such a run up into sections so that one can take advantage of parallel processing?&amp;nbsp; I haven't been able to find any documentation for the proc, thus am just assuming it may help in accomplishing a particular task.&amp;nbsp; Specifically, we are trying to create two crosswalks, one regarding spelling and another basically a list of synonyms.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;TIA,&lt;/P&gt;&lt;P&gt;Art&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Wed, 17 Oct 2012 12:33:57 GMT</pubDate>
    <dc:creator>art297</dc:creator>
    <dc:date>2012-10-17T12:33:57Z</dc:date>
    <item>
      <title>Is there a way to breakup a proc tmspell run into a set of smaller tasks?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Is-there-a-way-to-breakup-a-proc-tmspell-run-into-a-set-of/m-p/96974#M782</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I'm trying to run proc tmspell on a file that has almost 2 million entries.&amp;nbsp; The proc has been running for the past 20 hours and I have no way of knowing which will come first: the proc's completion or a power outage.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Is it possible to break such a run up into sections so that one can take advantage of parallel processing?&amp;nbsp; I haven't been able to find any documentation for the proc, thus am just assuming it may help in accomplishing a particular task.&amp;nbsp; Specifically, we are trying to create two crosswalks, one regarding spelling and another basically a list of synonyms.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;TIA,&lt;/P&gt;&lt;P&gt;Art&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 17 Oct 2012 12:33:57 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Is-there-a-way-to-breakup-a-proc-tmspell-run-into-a-set-of/m-p/96974#M782</guid>
      <dc:creator>art297</dc:creator>
      <dc:date>2012-10-17T12:33:57Z</dc:date>
    </item>
    <item>
      <title>Re: Is there a way to breakup a proc tmspell run into a set of smaller tasks?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Is-there-a-way-to-breakup-a-proc-tmspell-run-into-a-set-of/m-p/388885#M5883</link>
      <description>There is not a way to have the procedure broken up into multiple sections from a parallel or multi-threaded point of view.  The best practice for running PROC TMSPELL on a large data set would be to perform some pre-processing beforehand.  For exampl, in the terms table that is output by PROC TGPARSE, subset the table and only use the terms that have a Keep status of Yes.  This should shrink the number of terms to be run through PROC TMSPELL considerably and will thus increase performance immensely.</description>
      <pubDate>Thu, 17 Aug 2017 17:03:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Is-there-a-way-to-breakup-a-proc-tmspell-run-into-a-set-of/m-p/388885#M5883</guid>
      <dc:creator>CraigDeVault</dc:creator>
      <dc:date>2017-08-17T17:03:18Z</dc:date>
    </item>
  </channel>
</rss>

