I've been a SAS user for the last few years but I'm new to Text Mining. At the outset I would like to let you know that I don't have access to SAS Text Miner/SAS Enterprise Miner. I just have SAS EG with me.
The challenge I'm facing is with regards to Text Search and Count from a "comment" field in a market research survey. My goal is to count the number of occurences of words other than prepositions/articles/conjunctions etc. This gives me an idea about what people are trying to convey using the open comments. I need to do this using SAS Code and not any of the Text Mining Software from SAS.
It would help if someone can point to what is the best way to achieve this. What are the steps I should take? Even if simple pointers are given, I can build the code.
I'm an old Base SAS user and have done some work with the INDEX functions and Macro processing to evaluate text data. Considering EG is your only tool, I would use the "Code Node" and the Base SAS functions & Macros with Data Step programming. Of course, you'll need to find (or build) a database containing the content you are looking for or you can use several other methods. But, I would start with the basics and build from your research. The SAS Online Docs for Base SAS would be very helpful in this regard.
This would be much easier with Text Miner because it can distinguish when terms are being used as prepositions/articles/conjunctions etc. rather than being purely string based. I am sure your entire analysis would benefit from other features of Text Miner as well.