02-04-2015 09:42 AM
If we have existing SAS 9.4 up and running,
What is the minimum requirement to import and export data to hadoop from SAS? Is that possible without purchasing any additional SAS components? Do i must have SAS/ACCESS for Hadoop?
Is it must to have one of the hadoop distribution like cloudera...? or i can run Apache hadoop on commodity HW and work and connect with SAS?
This is not for production environment, it just for testing and understanding SAS and hadoop? I know that there are many consideration for planning for hadoop, but i am asking for the simplest connection and interaction.
Also is this enough to run the transformation in DIS?
I am not looking for document, but looking for short experienced comments and advice and will go in details later.
02-04-2015 10:06 AM
Too lazy are we?
Short answer: using file interface with hdfs using FILENAME and SPDE hdfs engine requires Base SAS.
For data base queries against Hive, you need SAS/ACCESS license.
02-05-2015 04:58 AM
Thanks a lot
What about the supported Hadoop distribution, it must be (C, H, MapR, IBM, Pivotal) or a running Hadoop on Linux (Single-Node Cluster) is working too? is there any other restrictions?
After that will the Hadoop transformation in DIS work except the hive one sure?
02-05-2015 03:45 PM
Please, you could check this out in the system requirements as well as having us do it.
On what functionality is covered is quite clearly described under each product (Base, DI Studio, SAS/ACCESS).
11-19-2015 03:33 PM
With just Base SAS, you can use the FILENAME statement to access data in HDFS, PROC HADOOP to interact with Hadoop data by running Apache Hadoop code, and the SPD Engine to write data, retrieve data, perform administrative functions and even update data in HDFS.
The version of SAS 9.4 determines which Hadoop distributions are supported. This site lists the SAS 9.4 supported Hadoop distributions for several SAS products, offerings, and technologies, including the Base SAS FILENAME statement, PROC HADOOP, and the SPD Engine:
If you would like overview information for SAS and Hadoop technologies, use this document. Each overview tells you what the product is, what's required, and where to go for more detailed documentation.