BookmarkSubscribeRSS Feed

Achieving Optimal Performance with the SAS Data Connect Accelerator for Hadoop

Started ‎07-16-2020 by
Modified ‎07-16-2020 by
Views 2,164

Learn how to optimize the SAS Embedded Process and take advantage of the new Apache Spark Continuous Processing mode available in the SAS Embedded Process, by watching this video. SAS Principal Software Developer David Ghazaleh explains some of the variables to consider, including cluster size, number of cores available, size of the data and number of splits in a file.

 

 

Video Highlights

0:18 – The situation

1:03 – Intro to SAS Viya Cloud Analytic Services (CAS) and SAS Embedded Processes

2:52 – SAS Data Connect Accelerator for Hadoop

10:33 – Spark application components

16:44 – What’s the best setting for SAS Data Connect Accelerator for Hadoop?

 

Read the Paper


Related Resources

Introduction to SAS and Hadoop (training course)
SAS Data Connector to Hadoop (documentation)
More SAS Data Management SAS Communities articles

 

Version history
Last update:
‎07-16-2020 03:44 PM
Updated by:
Contributors

hackathon24-white-horiz.png

The 2025 SAS Hackathon Kicks Off on June 11!

Watch the live Hackathon Kickoff to get all the essential information about the SAS Hackathon—including how to join, how to participate, and expert tips for success.

YouTube LinkedIn

SAS AI and Machine Learning Courses

The rapid growth of AI technologies is driving an AI skills gap and demand for AI talent. Ready to grow your AI literacy? SAS offers free ways to get started for beginners, business leaders, and analytics professionals of all skill levels. Your future self will thank you.

Get started

Article Tags