Text mining and content categorization

Running a text mining flow on a EUC-CN operating system returns "ERROR: TGparse internal core error

Reply
Established User
Posts: 1

Running a text mining flow on a EUC-CN operating system returns "ERROR: TGparse internal core error

When running a text mining flow (the file contains Chinese) on a EUC-CN operating system, the Text Parsing node (parsing language is Chinese) returns the following error:

Run time error was encountered. Please see the log in the node Results window for more details. 

ERROR: TGparse internal core error: Unable to open C:\Program Files\SASHome\SASFoundation\9.4\tktg\sasmisc\zh-std.uhtagger.

 

The detailed log info:

NOTE: Begin parsing the document.

MPRINT(TM_PARSE): proc tgparse data=_train key=EMWS1.TextParsing_terms out=EMWS1.TextParsing_tmout config=EMWS1.TextParsing_tmconfig multiterm="\\Mac\Home\Documents\My SAS Files\9.4\Industry\Workspaces\EMWS1\TextParsing\multiword.txt" stemming=no tagging=yes entities=no buildindex=yes indexpath="\\Mac\Home\Documents\My SAS Files\9.4\Industry\Workspaces\EMWS1\TextParsing\" ng=std
MPRINT(TM_PARSE): language=
MPRINT(LOWCASE): chinese
MPRINT(TM_PARSE): outoffset=EMWS1.TextParsing_tmoutpos addsentence addparagraph ; MPRINT(TM_PARSE): var Intro ;
MPRINT(TM_PARSE): ;
MPRINT(TM_PARSE): select "AUX" "CONJ" "DET" "INTERJ" "PART" "PREP" "PRON" "Newline" / drop;
MPRINT(TM_PARSE): select "NUM" "PUNCT" / group="attributes" drop ; MPRINT(TM_PARSE): ;
MPRINT(TM_PARSE): run;

NOTE: Input data set WORK._TRAIN successfully opened.
WARNING: Multiterm words are not supported for the CHINESE language is this release. The multiterm file is ignored.
NOTE: Reading variable number 2 as a body of text.
ERROR: TGparse internal core error: Unable to open C:\Program Files\SASHome\SASFoundation\9.4\tktg\sasmisc\zh-std.uhtagger.

 

How can I fix it? Thanks a lot.

Ask a Question
Discussion stats
  • 0 replies
  • 132 views
  • 0 likes
  • 1 in conversation