BookmarkSubscribeRSS Feed

What’s new in SAS Data Loader for Hadoop 2.4

Started ‎01-11-2016 by
Modified ‎01-15-2016 by
Views 1,467

SAS Data Loader for Hadoop 2.4, generally available on Monday, January 11, 2016, includes features that seek to achieve three goals:


  1. Speed up data management processes with Spark
  2. Improve productivity of data management professionals 
  3. Manage data where it lives

Below is a summary of what’s new in the 2.4 release. For more details, please see the SAS Data Loader for Hadoop 2.4 User’s Guide

Speed up data management processes with Spark

  • Improved performance using Spark and ImpalaNew support for Spark brings massively parallel in-memory processing to the following directives: Cleanse Data, Transform Data and Cluster-Survive. Impala can now be leveraged in the following directives: Query or Join, Sort and De-Duplicate and Run a Hadoop SQL Program (formerly called “Run a Hive Program”).
  • Increased performance of profiling jobs


Improve productivity of data management professionals

  • Improved syntax editing
  • Chain directivesCreate a data flow that uses two or more saved directives which can be executed in serial or in parallel.
  • New “Match-Merge” directiveUse the new “Match-Merge” directive to append columns from multiple source tables into a single target table. Column data values can also be updated when rows match in two or more source tables.
  • New “Cluster-Survive” directiveThe new “Cluster-Survive” directive leverages user-defined rules to create clusters of similar records. Additional user-defined rules can be created to construct a survivor record that will replace the cluster of rows in the target table.
  • New “Delete Rows” directive


Manage data where it lives

  • Added support for IBM BigInsights and Pivotal HD
  • Expanded support to now include VirtualBox and VMWare Hypervisors
  • Schedule jobs using a REST APIA REST API can now be used to schedule and execute saved directives. The API can also return the job’s state, results, log file or error messages, along with being able to cancel running jobs and delete job information.
  • Apply and reload Hadoop configuration changes


New trial version

Download a free trial version of SAS Data Loader for Hadoop, to be installed on a production Hadoop cluster. This can be converted into a production license without reinstalling the software.





when I download DL trial, it is the old one - DL 2.2. I am doing something wrong? Is it possible to make things work with downloading DL 2.4 vApp Cloudera and Cloudera Quickstart VM 5.3?

Version history
Last update:
‎01-15-2016 11:09 AM
Updated by:



Registration is open! SAS is returning to Vegas for an AI and analytics experience like no other! Whether you're an executive, manager, end user or SAS partner, SAS Innovate is designed for everyone on your team. Register for just $495 by 12/31/2023.

If you are interested in speaking, there is still time to submit a session idea. More details are posted on the website. 

Register now!

Free course: Data Literacy Essentials

Data Literacy is for all, even absolute beginners. Jump on board with this free e-learning  and boost your career prospects.

Get Started

Article Labels
Article Tags