BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
art297
Opal | Level 21

I'm trying to run proc tmspell on a file that has almost 2 million entries.  The proc has been running for the past 20 hours and I have no way of knowing which will come first: the proc's completion or a power outage.

Is it possible to break such a run up into sections so that one can take advantage of parallel processing?  I haven't been able to find any documentation for the proc, thus am just assuming it may help in accomplishing a particular task.  Specifically, we are trying to create two crosswalks, one regarding spelling and another basically a list of synonyms.

TIA,

Art

1 ACCEPTED SOLUTION

Accepted Solutions
CraigDeVault
SAS Employee
There is not a way to have the procedure broken up into multiple sections from a parallel or multi-threaded point of view. The best practice for running PROC TMSPELL on a large data set would be to perform some pre-processing beforehand. For exampl, in the terms table that is output by PROC TGPARSE, subset the table and only use the terms that have a Keep status of Yes. This should shrink the number of terms to be run through PROC TMSPELL considerably and will thus increase performance immensely.

View solution in original post

1 REPLY 1
CraigDeVault
SAS Employee
There is not a way to have the procedure broken up into multiple sections from a parallel or multi-threaded point of view. The best practice for running PROC TMSPELL on a large data set would be to perform some pre-processing beforehand. For exampl, in the terms table that is output by PROC TGPARSE, subset the table and only use the terms that have a Keep status of Yes. This should shrink the number of terms to be run through PROC TMSPELL considerably and will thus increase performance immensely.

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 942 views
  • 1 like
  • 2 in conversation