BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
art297
Opal | Level 21

I'm trying to run proc tmspell on a file that has almost 2 million entries.  The proc has been running for the past 20 hours and I have no way of knowing which will come first: the proc's completion or a power outage.

Is it possible to break such a run up into sections so that one can take advantage of parallel processing?  I haven't been able to find any documentation for the proc, thus am just assuming it may help in accomplishing a particular task.  Specifically, we are trying to create two crosswalks, one regarding spelling and another basically a list of synonyms.

TIA,

Art

1 ACCEPTED SOLUTION

Accepted Solutions
CraigDeVault
SAS Employee
There is not a way to have the procedure broken up into multiple sections from a parallel or multi-threaded point of view. The best practice for running PROC TMSPELL on a large data set would be to perform some pre-processing beforehand. For exampl, in the terms table that is output by PROC TGPARSE, subset the table and only use the terms that have a Keep status of Yes. This should shrink the number of terms to be run through PROC TMSPELL considerably and will thus increase performance immensely.

View solution in original post

1 REPLY 1
CraigDeVault
SAS Employee
There is not a way to have the procedure broken up into multiple sections from a parallel or multi-threaded point of view. The best practice for running PROC TMSPELL on a large data set would be to perform some pre-processing beforehand. For exampl, in the terms table that is output by PROC TGPARSE, subset the table and only use the terms that have a Keep status of Yes. This should shrink the number of terms to be run through PROC TMSPELL considerably and will thus increase performance immensely.

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 956 views
  • 1 like
  • 2 in conversation