I've been tasked to explore whether we can use any SAS tools for a particular text-analytics project. The project involves importing collections of documents of different formats (docx, xlsx, pdf, etc.) into SAS.
SAS Text Miner (part of SAS Enterprise Miner) might be a possible option.
I'm trying to find out all the file formats that the Text Import Node (which is apparently based on the SAS Document Convertor) supports. The training manual says that "More than 100 file formats are supported". I've trawled through the SAS documentation but for the life of me I can't seem to find an exhaustive list of all the file formats supported.
Please could anyone point me towards a list of all the file formats that the Text Import Node (SAS Document Convertor) supports?
Thanks!
SAS does not publish an exhaustive list of all of the file formats supported by the Text Import Node as this list is constantly evolving and subject to change. However, the general list of file formats which are supported by the Text Import Node is as follows:
• .txt
• .rtf
• .csv
• .all
• .asc
• .xlsx
• .docx
• .pptx
• .htm
• .html
• .xml
• Portable Document Format (PDF)
• Microsoft Office 2007/2010/2013
SAS does not publish an exhaustive list of all of the file formats supported by the Text Import Node as this list is constantly evolving and subject to change. However, the general list of file formats which are supported by the Text Import Node is as follows:
• .txt
• .rtf
• .csv
• .all
• .asc
• .xlsx
• .docx
• .pptx
• .htm
• .html
• .xml
• Portable Document Format (PDF)
• Microsoft Office 2007/2010/2013
Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!
Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.
Find more tutorials on the SAS Users YouTube channel.