If you want to use SAS to perform data analysis but want your data stored in the Parquet on-disk format rather than the traditional sas7bdat format, this presentation is for you. Parquet is an open source on-disk format that has been traditionally utilized in a Hadoop context but can exist outside a Hadoop cluster. Arrow is the in-memory variant of the associated technology. Both are columnar data stores and yield more efficient data query performance. For these reasons, Arrow/Parquet has gained popularity throughout the data analytics community. Learn about the Arrow/Parquet feature that you can use while continuing to conduct analysis with SAS.
Presentation slides are attached. See also Parquet Support in SAS Compute Server.
Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!
SAS Explore 2023 presentations are now available! (Also indexed for search at lexjansen.com!)
View all available SAS Explore content by category: