Text mining and content categorization

High Performance text miner

Reply
Occasional Contributor
Posts: 5

High Performance text miner

/* Process the document collection */
proc hptmine data=documents_dataset;
    doc_id docid; var text;
    parse termwgt=entropy cellwgt=log
run;
svd
stop=sashelp.engstop
outconfig=config
outterms=key;
max_k=50 res=med
svdu=u svdv=v svds=s;

I have a high performance code that lists out SPD's and the code is attached (HPTMINE). I wanted to check if there is a similar kind of High performance proc for generating text clusters and text topics. If we have, can you please help us with a sample code.

 

Thanks a lot in advance!

 

Best regards,

Sharat

Frequent Contributor
Posts: 130

Re: High Performance text miner

Can you use SAS Enterprise Miner? If so, you can inspect the code generated behind text topic and text cluster nodes to get the code you need.
Ask a Question
Discussion stats
  • 1 reply
  • 345 views
  • 0 likes
  • 2 in conversation