Hi.
There's a task: need to create Workflows from many different sources, included SAS, to put data into HDFS. So, there's no problem to connect with other sources (Oracle, MySQL, etc).
But there's an issue with SAS: i don't know how to connect to this system. Where's can i get JDBC. And how create WF for sqoop with source at sas?
1. Need any way to get data (and metadata - like column types) from SAS. Maybe, Python.
2. Way to connect from sqoop
First point is more important when second one. Pleace, help me.
SAS provides an interface to Hadoop:
https://www.sas.com/en_be/software/access-interface-hadoop.html
Its very hard to suggest anything else without any knowledge of the whole process. Datasets are just files, so if you know the file structure you could extract data, or convert to CSV and add that way, there are loads of options. Don't know this sqoop, perhaps try asking on its forums how to import files?
Its very hard to suggest anything else without any knowledge of the whole process.
I see it like (select * from DDS_SRG2.my_table) to download data and then... U may do anything with it. But first - download thim.
Datasets are just files
And where're they stored? If i know socket (host:port) and logical dislocation (DDS_SRG2/my_table) how to get file? Even without any SQL.
Don't know this sqoop, perhaps try asking on its forums how to import files?
There's no issue to make import. Issue to take data in SAS from outside.
SAS provides an interface to Hadoop: https://www.sas.com/en_be/software/access-interface-hadoop.html
What is it? It's an app or what?
Need download and install? I havn't such premissions at my work station.
Its very hard to suggest anything else without any knowledge of the whole process.
I see it like (select * from DDS_SRG2.my_table) to download data and then... U may do anything with it. But first - download thim.
-- I have no idea what you are talking about. SAS datasets are not an SQL database, they are files stored on a network drive or locally.
Datasets are just files
And where're they stored? If i know socket (hostort) and logical dislocation (DDS_SRG2/my_table) how to get file? Even without any SQL.
-- Sorry, what does any of that mean? SAS datasets are stored on a network drive or locally. Depends on your setup of which you have told us nothing. Speak to your SAS people and they should be able to tell you where the data is stored.
Don't know this sqoop, perhaps try asking on its forums how to import files?
There's no issue to make import. Issue to take data in SAS from outside.
-- SAS datasets are just files. You can use the SAS system to export the data to multiple file types. It depends what you need.
SAS provides an interface to Hadoop: https://www.sas.com/en_be/software/access-interface-hadoop.html
What is it? It's an app or what?
Need download and install? I havn't such premissions at my work station.
-- Its an additional component of the SAS system. If you do not know that, then your first point of call should be with the SAS people at your company, they will be able to show you the system, and how to get data in/out.
sqoop-import wants delimited text files (csv), so that should be the way to go. Export from SAS to csv, and import that.
SAS is not a dbms but you need like for a dbms a (SAS) Server licensed, installed, running and accessible which serves you the data. SAS data is stored in SAS propriatery files on disk.
The available options how to access SAS files with 3rd party tools are listed here: https://communities.sas.com/t5/SAS-Enterprise-Guide/Third-Party-BI-Tools-supporting-SAS/m-p/434828/h...
Alternatively you let SAS write the data to Hadoop. For this you need also the SAS Access Interface to Hadoop licensed. If taking such an approach you would call a SAS process (a program) from a 3rd party tool and then let SAS do the work (so push and not pull). I would expect this option to perform better as it leverages the SAS Access to Hadoop interface.
It sounds like yo are not an experienced SAS user.
Thee are SAS JDBC drivers available for download, I suggest that you consult support.sas.com to under how to configure it. By using that, then SAS looks like a database with tables.
But I also strongly advise to talk to your SAS admin/power use/whatever to elaborate how to make SAS data available to you.
The SAS interface to Hadoop that @RW9 speaks about is mainly for SAS to push data and processing to Hadoop, so perhaps not what you are looking for?
Nevertheless you can expor data directly to hdfs file system (using a FILENAME engine for Hadoop, doesn't require SAS/ACCESS to Hadoop licence), and thn you could pick up it from Sqoop or whatever tool you want.
Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!
What’s the difference between SAS Enterprise Guide and SAS Studio? How are they similar? Just ask SAS’ Danny Modlin.
Find more tutorials on the SAS Users YouTube channel.