I've installed the various Sentiment Analysis tools (studio, server and workbench). I've already created my training corpus and created a Statistical Model in studio. I've uploaded the model to the server.
I am now creating a new project in Workbench. There is a tab where I specify my corpus and upload it. The upload fails every time with the error "Unable to upload file".
The file I am uploading is a zipped folder of text files. Here are my guesses as to what may be happening:
1) the file is being uploaded to a folder which I (i.e. the web server or workbench user) may not have permissions to access. But what folder would that be?
2) perhaps the folder is not uploaded, but the contents read and placed into the MySQL database?
3) the file format is incorrect. I also tried zipping only the text documents. That did not work. Perhaps the formats of the files themselves are not acceptable.
I have no clue how to proceed. Any suggestions are appreciated.
I just solved my own question just now. Will it count to mark this as the right answer?
I went into the directory where SAS SA Workbench is installed. There is a "test_documents" folder with an example corpus. It looks like the corpus needs to be a zipped folder of XML files. Each document has the following format:
<doc>
<docid><![CDATA[filename .xml without extension]]></docid>
<title><![CDATA[subject title here]]></title>
<createtime><![CDATA[10/6/2008 10:00:00 AM]]></createtime>
<body><![CDATA[blah blah blah yadda yadda yadda text text text]]></body>
</doc>
What sucks is that the SAS sentiment tools don't appear to build my corpus for me (unless I am missing something?). Instead, I have to joys of converting all of my text files into xml files with this format.
I did manually change 5 of my .txt to .xml with the above xml structure. I was able to upload this successfully.
I just solved my own question just now. Will it count to mark this as the right answer?
I went into the directory where SAS SA Workbench is installed. There is a "test_documents" folder with an example corpus. It looks like the corpus needs to be a zipped folder of XML files. Each document has the following format:
<doc>
<docid><![CDATA[filename .xml without extension]]></docid>
<title><![CDATA[subject title here]]></title>
<createtime><![CDATA[10/6/2008 10:00:00 AM]]></createtime>
<body><![CDATA[blah blah blah yadda yadda yadda text text text]]></body>
</doc>
What sucks is that the SAS sentiment tools don't appear to build my corpus for me (unless I am missing something?). Instead, I have to joys of converting all of my text files into xml files with this format.
I did manually change 5 of my .txt to .xml with the above xml structure. I was able to upload this successfully.
Don't miss out on SAS Innovate - Register now for the FREE Livestream!
Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.
Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.
Find more tutorials on the SAS Users YouTube channel.