Hi,
I use SAS 9.4 TS Level 1M3. Does anyone know a solution for reading Parquet files (https://en.wikipedia.org/wiki/Apache_Parquet ) by using SAS without having SAS/ACCESS Interface to Hadoop?
Thanks a lot, Sven
So how do you plan to connect to Hadoop to get access to the file in first place?
The background is that a colleague uses Spark and Scala for producing data, and he stores his results within Parquet files. I think, his setup comprises a hive server but he does not use Hadoop.
So far, his recommendation is to use a Hive or a MapR Drill ODBC driver to get access to the data via SAS/ACCESS Interface to ODBC. And my question is, if someone has experience in doing so, or if someone knows an alternative (without having SAS Hadoop).
Sorry for the amateurish explanation, but I am not a techie at all, and I do not have a good understanding of the technical infrastructure.
You need either SAS/Access to Hadoop, SAS/Access to ODBC or SAS/Access to Impala
Do you have any of these interfaces licensed?
Use the following to check your licenses, the information is in the log:
*show what's licensed;
proc setinit;run;
*what's installed;
proc product_status;
run;
SAS/ACCESS Interface to ODBC is licensed and also installed, but neither the Hadoop package nor the Impala package.
The only (?) alternative to the ODBC concept is have the parquet file exported to a format that you local SAS session can access, e.g. csv.
Join us for SAS Innovate 2025, our biggest and most exciting global event of the year, in Orlando, FL, from May 6-9. Sign up by March 14 for just $795.
Need to connect to databases in SAS Viya? SAS’ David Ghan shows you two methods – via SAS/ACCESS LIBNAME and SAS Data Connector SASLIBS – in this video.
Find more tutorials on the SAS Users YouTube channel.