BookmarkSubscribeRSS Feed

Arrow/Parquet in SAS Compute

Started ‎10-02-2023 by
Modified ‎11-03-2023 by
Views 430

If you want to use SAS to perform data analysis but want your data stored in the Parquet on-disk format rather than the traditional sas7bdat format, this presentation is for you. Parquet is an open source on-disk format that has been traditionally utilized in a Hadoop context but can exist outside a Hadoop cluster. Arrow is the in-memory variant of the associated technology. Both are columnar data stores and yield more efficient data query performance. For these reasons, Arrow/Parquet has gained popularity throughout the data analytics community. Learn about the Arrow/Parquet feature that you can use while continuing to conduct analysis with SAS.

 

Presentation slides are attached. See also Parquet Support in SAS Compute Server.

Version history
Last update:
‎11-03-2023 12:59 PM
Updated by:

Ready to join fellow brilliant minds for the SAS Hackathon?

Build your skills. Make connections. Enjoy creative freedom. Maybe change the world. Registration is now open through August 30th. Visit the SAS Hackathon homepage.

Register today!
Article Tags

SAS Explore 2023 presentations are now available! (Also indexed for search at lexjansen.com!)

View all available SAS Explore content by category: