BookmarkSubscribeRSS Feed
SvenWagner
Fluorite | Level 6

Hi, 

 

I use SAS 9.4 TS Level 1M3. Does anyone know a solution for reading Parquet files (https://en.wikipedia.org/wiki/Apache_Parquet ) by using SAS without having SAS/ACCESS Interface to Hadoop? 

 

Thanks a lot, Sven

8 REPLIES 8
Patrick
Opal | Level 21

So how do you plan to connect to Hadoop to get access to the file in first place?

SvenWagner
Fluorite | Level 6

The background is that a colleague uses Spark and Scala for producing data, and he stores his results within Parquet files. I think, his setup comprises a hive server but he does not use Hadoop.

 

So far, his recommendation is to use a Hive or a MapR Drill ODBC driver to get access to the data via SAS/ACCESS Interface to ODBC. And my question is, if someone has experience in doing so, or if someone knows an alternative (without having SAS Hadoop). 

 

Sorry for the amateurish explanation, but I am not a techie at all, and I do not have a good understanding of the technical infrastructure.

Patrick
Opal | Level 21

You need either SAS/Access to Hadoop, SAS/Access to ODBC or SAS/Access to Impala

Do you have any of these interfaces licensed?

vipinj765
Calcite | Level 5
Hi,
I have local downloaded copy of parquet file on linux server. Instead of connecting to hadoop cluster, i want to read the local version. Is there any way to do this in SAS 9.4m6 ?
Reeza
Super User

Use the following to check your licenses, the information is in the log:

*show what's licensed;
proc setinit;run;

*what's installed;
proc product_status;

run;
SvenWagner
Fluorite | Level 6

SAS/ACCESS Interface to ODBC is licensed and also installed, but neither the Hadoop package nor the Impala package.

LinusH
Tourmaline | Level 20

The only (?) alternative to the ODBC concept is have the parquet file exported to a format that you local SAS session can access, e.g. csv.

Data never sleeps
LinusH
Tourmaline | Level 20
Just want to add, if you create a csv file, that hasn't to be moved to your local SAS server.
You can use FILENAME HADOOP to access the file in hdfs.
PROC HADOOP let you submit hdfs commands or exxecute pig programs which can come in handy when converting the parquet file.
None of these require a separate SAS/ACCESS licence.
Data never sleeps

sas-innovate-wordmark-2025-midnight.png

Register Today!

Join us for SAS Innovate 2025, our biggest and most exciting global event of the year, in Orlando, FL, from May 6-9. Sign up by March 14 for just $795.


Register now!

How to connect to databases in SAS Viya

Need to connect to databases in SAS Viya? SAS’ David Ghan shows you two methods – via SAS/ACCESS LIBNAME and SAS Data Connector SASLIBS – in this video.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 8 replies
  • 9857 views
  • 2 likes
  • 5 in conversation