11-30-2015 03:03 PM
I would like to know a bit about SAS and Hadoop integration.
I know that you can connect SAS to Haddop via libname using SAS/ACCESS to Hadoop. As far as I know in that case you interact with Hadoop using HiveQL. You can insert SAS tables in HDFS using hiveQL and you can read Hive tables from SAS.
Other way is using data loader product. I think it also connect with HiveQL
Also you can use proc hadoop to execute hdfs commands.
I would like to know how you can do analytics procedures in Hadoop. For example High performance analytics procedures. I have read that this HP procedures can work with Hadoop, but how does it worsks??, is also a connector using HiveQL?, can you execute the procedures in the haddop clusters using map-reduce??, I wnat to know more about thin interaction....if you only gest data from Hadoop or if you can take advantage of hadoop clusters performance executing in a parallel architecture.
Other question... SAS Visual Analytics and Hadoop,...I supopose that you can get information from Hadoop (vía HiveQL) and upload to LASR server..is it right??, all the calculations and aggregations are made in LASR not in Hadoop cluster..is it right??
Thank in advance
12-01-2015 02:06 PM
Great that you are curious. But do you have an issue?
If you are interested in starting to use SAS with Hadoop, a contact to your local SAS office is a good start.
For an overall map of SAS-Hadoop integration, Paul Kents Global Forum papers are a good start:
For HPF, it works like an in-database processing. For the details in the set up, again, talk a SAS representative (also because it's surprisingly hard to get this information from the products page).
For Visuall Analytuics, the standard setup is to load data in hfs, but in a SAS LASR specific format, optimized to quickly load into memory. Perhaps you could use Hive (or in a near future), but wouldn't be optimal since SAS maps its's on data byte-wise to memory on the local server, Hive data is spread around the cluster, so such 1-1 data mapping isn't possible (I think).
12-04-2015 03:38 PM
To add to LinusH’s response, here are some resources that may help answer your questions.
A couple technical webinars led by SAS Technical Architect Jeff Bailey:
Two papers with general overviews of how SAS works with Hadoop: