Learn how to optimize the SAS Embedded Process and take advantage of the new Apache Spark Continuous Processing mode available in the SAS Embedded Process, by watching this video. SAS Principal Software Developer David Ghazaleh explains some of the variables to consider, including cluster size, number of cores available, size of the data and number of splits in a file.
0:18 – The situation
1:03 – Intro to SAS Viya Cloud Analytic Services (CAS) and SAS Embedded Processes
2:52 – SAS Data Connect Accelerator for Hadoop
10:33 – Spark application components
16:44 – What’s the best setting for SAS Data Connect Accelerator for Hadoop?
Introduction to SAS and Hadoop (training course)
SAS Data Connector to Hadoop (documentation)
More SAS Data Management SAS Communities articles
The rapid growth of AI technologies is driving an AI skills gap and demand for AI talent. Ready to grow your AI literacy? SAS offers free ways to get started for beginners, business leaders, and analytics professionals of all skill levels. Your future self will thank you.