Learn how to optimize the SAS Embedded Process and take advantage of the new Apache Spark Continuous Processing mode available in the SAS Embedded Process, by watching this video. SAS Principal Software Developer David Ghazaleh explains some of the variables to consider, including cluster size, number of cores available, size of the data and number of splits in a file.
0:18 – The situation
1:03 – Intro to SAS Viya Cloud Analytic Services (CAS) and SAS Embedded Processes
2:52 – SAS Data Connect Accelerator for Hadoop
10:33 – Spark application components
16:44 – What’s the best setting for SAS Data Connect Accelerator for Hadoop?
Introduction to SAS and Hadoop (training course)
SAS Data Connector to Hadoop (documentation)
More SAS Data Management SAS Communities articles
Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 16. Read more here about why you should contribute and what is in it for you!
Data Literacy is for all, even absolute beginners. Jump on board with this free e-learning and boost your career prospects.