BookmarkSubscribeRSS Feed

Achieving Optimal Performance with the SAS Data Connect Accelerator for Hadoop

Started ‎07-16-2020 by
Modified ‎07-16-2020 by
Views 1,718

Learn how to optimize the SAS Embedded Process and take advantage of the new Apache Spark Continuous Processing mode available in the SAS Embedded Process, by watching this video. SAS Principal Software Developer David Ghazaleh explains some of the variables to consider, including cluster size, number of cores available, size of the data and number of splits in a file.

 

 

Video Highlights

0:18 – The situation

1:03 – Intro to SAS Viya Cloud Analytic Services (CAS) and SAS Embedded Processes

2:52 – SAS Data Connect Accelerator for Hadoop

10:33 – Spark application components

16:44 – What’s the best setting for SAS Data Connect Accelerator for Hadoop?

 

Read the Paper


Related Resources

Introduction to SAS and Hadoop (training course)
SAS Data Connector to Hadoop (documentation)
More SAS Data Management SAS Communities articles

 

Version history
Last update:
‎07-16-2020 03:44 PM
Updated by:
Contributors

SAS INNOVATE 2024

Innovate_SAS_Blue.png

Registration is open! SAS is returning to Vegas for an AI and analytics experience like no other! Whether you're an executive, manager, end user or SAS partner, SAS Innovate is designed for everyone on your team. Register for just $495 by 12/31/2023.

If you are interested in speaking, there is still time to submit a session idea. More details are posted on the website. 

Register now!

Free course: Data Literacy Essentials

Data Literacy is for all, even absolute beginners. Jump on board with this free e-learning  and boost your career prospects.

Get Started

Article Tags