BookmarkSubscribeRSS Feed

Arrow/Parquet in SAS Compute

Started ‎10-02-2023 by
Modified ‎11-03-2023 by
Views 424

If you want to use SAS to perform data analysis but want your data stored in the Parquet on-disk format rather than the traditional sas7bdat format, this presentation is for you. Parquet is an open source on-disk format that has been traditionally utilized in a Hadoop context but can exist outside a Hadoop cluster. Arrow is the in-memory variant of the associated technology. Both are columnar data stores and yield more efficient data query performance. For these reasons, Arrow/Parquet has gained popularity throughout the data analytics community. Learn about the Arrow/Parquet feature that you can use while continuing to conduct analysis with SAS.

 

Presentation slides are attached. See also Parquet Support in SAS Compute Server.

Version history
Last update:
‎11-03-2023 12:59 PM
Updated by:

sas-innovate-2024.png

Available on demand!

Missed SAS Innovate Las Vegas? Watch all the action for free! View the keynotes, general sessions and 22 breakouts on demand.

 

Register now!

Article Tags

SAS Explore 2023 presentations are now available! (Also indexed for search at lexjansen.com!)

View all available SAS Explore content by category: