BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
art297
Opal | Level 21

I'm trying to run proc tmspell on a file that has almost 2 million entries.  The proc has been running for the past 20 hours and I have no way of knowing which will come first: the proc's completion or a power outage.

Is it possible to break such a run up into sections so that one can take advantage of parallel processing?  I haven't been able to find any documentation for the proc, thus am just assuming it may help in accomplishing a particular task.  Specifically, we are trying to create two crosswalks, one regarding spelling and another basically a list of synonyms.

TIA,

Art

1 ACCEPTED SOLUTION

Accepted Solutions
CraigDeVault
SAS Employee
There is not a way to have the procedure broken up into multiple sections from a parallel or multi-threaded point of view. The best practice for running PROC TMSPELL on a large data set would be to perform some pre-processing beforehand. For exampl, in the terms table that is output by PROC TGPARSE, subset the table and only use the terms that have a Keep status of Yes. This should shrink the number of terms to be run through PROC TMSPELL considerably and will thus increase performance immensely.

View solution in original post

1 REPLY 1
CraigDeVault
SAS Employee
There is not a way to have the procedure broken up into multiple sections from a parallel or multi-threaded point of view. The best practice for running PROC TMSPELL on a large data set would be to perform some pre-processing beforehand. For exampl, in the terms table that is output by PROC TGPARSE, subset the table and only use the terms that have a Keep status of Yes. This should shrink the number of terms to be run through PROC TMSPELL considerably and will thus increase performance immensely.

sas-innovate-2026-white.png



April 27 – 30 | Gaylord Texan | Grapevine, Texas

Registration is open

Walk in ready to learn. Walk out ready to deliver. This is the data and AI conference you can't afford to miss.
Register now and lock in 2025 pricing—just $495!

Register now

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 1 reply
  • 1423 views
  • 1 like
  • 2 in conversation