Hi,
I read that SAS Data Loader for H
adoop include the following 4 solutions:
My question is: To access and extract data from Hadoop we need all the four components, or we just use SAS Data Loader for Hadoop?
Because I'm studiyng how SAS can work with data from Hadoop and already see that can I:
I don't know If I'm thinking correclty...
Data Loader for Haddop is a package, so there are overlapping in functionality.
Data Loader is a single user environments in the current release, but that shouldn't be a problem for you?
If you have existing data in Hive, the basic license is SAS/ACCESS fro Hadoop.
Then you can use SQL and other stuff to extract and analyze data.
Then there's a question of how you wish to analyze/browse the data - these requirements are needed to chose the appropriate SAS tools.
I think your scope is somewhat complex.
This means that you need to
Bottom line is that you may need on-site guidance on how to architect your environment. I think SAS should assist you on this (especially on haw to use a subset of components in the "Loader" product compared to the separate modules). Or are you trying to get a second opinion?
Hi LinusH,
I'm just doing a research about how can SAS could exract some insights from Hadoop. I don't have any SAS license at this time, because is just a research program for my Master Thesis.
I've amount of data in Hadoop (some files with a large amount of data in HDFS) and I create with Hive some new tables to do some segmentations to reduce the quantity of data.
What I want now is available what are the options to explore my data with SAS to extract some insights. For that I need to access the Data in Hadoop using SAS (that's Why all my questions above because I seeing amount of options with the same goal). I already read that I can extract the data directly from HDFS to SAS (don't know what are the pre-requisites) or I can use SAS Data Loader for Hadoop.
If you have a amount of Data in HDFS, Hive or Hbase, whick solution do you use to extract into SAS. Or, is a better option, read directly to Hadoop via SAS?
Hope I have explained better.
Thanks for your help!
Data Loader for Haddop is a package, so there are overlapping in functionality.
Data Loader is a single user environments in the current release, but that shouldn't be a problem for you?
If you have existing data in Hive, the basic license is SAS/ACCESS fro Hadoop.
Then you can use SQL and other stuff to extract and analyze data.
Then there's a question of how you wish to analyze/browse the data - these requirements are needed to chose the appropriate SAS tools.
LinusH,
sorry only more one question:
If i said:
If we want to storage the data into our SAS machine we can use SAS/ACCESS for Hadoop or just Base (SAS). iF we want to analyze and explore the data in a SAS Application (lik SAS Visual Analytics) we use SAS SPDS because it's included on it structure.
Is this thinking wrong?
"Store the data into our SAS machine we can use SAS/ACCESS for Hadoop or just Base (SAS)."
If you want to benefit from the Hive metastore, you need SAS/ACCESS to Hadoop.
"Analyze and explore the data in a SAS Application (like SAS Visual Analytics) we use SAS SPDS"
SPDS is a great SAS data store. But it's not required for Visual Analytics. But loading data to the LASR server might go faster if you use SPDS (compared to base SAS data sets) - given the same physical conditions.
SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!
Need to connect to databases in SAS Viya? SAS’ David Ghan shows you two methods – via SAS/ACCESS LIBNAME and SAS Data Connector SASLIBS – in this video.
Find more tutorials on the SAS Users YouTube channel.