BookmarkSubscribeRSS Feed

Arrow/Parquet in SAS Compute

Started ‎10-02-2023 by
Modified ‎11-03-2023 by
Views 529

If you want to use SAS to perform data analysis but want your data stored in the Parquet on-disk format rather than the traditional sas7bdat format, this presentation is for you. Parquet is an open source on-disk format that has been traditionally utilized in a Hadoop context but can exist outside a Hadoop cluster. Arrow is the in-memory variant of the associated technology. Both are columnar data stores and yield more efficient data query performance. For these reasons, Arrow/Parquet has gained popularity throughout the data analytics community. Learn about the Arrow/Parquet feature that you can use while continuing to conduct analysis with SAS.

 

Presentation slides are attached. See also Parquet Support in SAS Compute Server.

Version history
Last update:
‎11-03-2023 12:59 PM
Updated by:

SAS Innovate 2025: Call for Content

Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!

Submit your idea!

Article Tags

SAS Explore 2023 presentations are now available! (Also indexed for search at lexjansen.com!)

View all available SAS Explore content by category: