BookmarkSubscribeRSS Feed

Arrow/Parquet in SAS Compute

Started ‎10-02-2023 by
Modified ‎11-03-2023 by
Views 191

If you want to use SAS to perform data analysis but want your data stored in the Parquet on-disk format rather than the traditional sas7bdat format, this presentation is for you. Parquet is an open source on-disk format that has been traditionally utilized in a Hadoop context but can exist outside a Hadoop cluster. Arrow is the in-memory variant of the associated technology. Both are columnar data stores and yield more efficient data query performance. For these reasons, Arrow/Parquet has gained popularity throughout the data analytics community. Learn about the Arrow/Parquet feature that you can use while continuing to conduct analysis with SAS.

 

Presentation slides are attached. See also Parquet Support in SAS Compute Server.

Version history
Last update:
‎11-03-2023 12:59 PM
Updated by:

SAS INNOVATE 2024

Innovate_SAS_Blue.png

Registration is open! SAS is returning to Vegas for an AI and analytics experience like no other! Whether you're an executive, manager, end user or SAS partner, SAS Innovate is designed for everyone on your team. Register for just $495 by 12/31/2023.

If you are interested in speaking, there is still time to submit a session idea. More details are posted on the website. 

Register now!

Article Tags

SAS Explore 2023 presentations are now available! (Also indexed for search at lexjansen.com!)

View all available SAS Explore content by category: