Text mining and content categorization

High Performance text miner

Reply
Occasional Contributor
Posts: 5

High Performance text miner

/* Process the document collection */
proc hptmine data=documents_dataset;
    doc_id docid; var text;
    parse termwgt=entropy cellwgt=log
run;
svd
stop=sashelp.engstop
outconfig=config
outterms=key;
max_k=50 res=med
svdu=u svdv=v svds=s;

I have a high performance code that lists out SPD's and the code is attached (HPTMINE). I wanted to check if there is a similar kind of High performance proc for generating text clusters and text topics. If we have, can you please help us with a sample code.

 

Thanks a lot in advance!

 

Best regards,

Sharat

Frequent Contributor
Posts: 136

Re: High Performance text miner

Posted in reply to sharat_dwibhasi_okstate_edu
Can you use SAS Enterprise Miner? If so, you can inspect the code generated behind text topic and text cluster nodes to get the code you need.
Ask a Question
Discussion stats
  • 1 reply
  • 399 views
  • 0 likes
  • 2 in conversation