For large data management projects it’s not uncommon for customers to deliver long and detailed requirements for data access and data integration. Data source access options for SAS data management offerings are plentiful and well documented but information on data integration options—the means for software applications to communicate and share data—are not as easy to come by (at least not all in one place).
So let’s forget about data access options for now (you have lots of them!) and focus on data integration. SAS customers are fortunate to have several data management applications to choose from that each offer a distinct set of features and play well with each other and other non-SAS systems. Choosing the right technology for your data management project will depend on, among other things, the integration options provided by each. Sometimes a combination of data management applications might be needed to match the requirements of the project and the varied skills of potential users. Here’s a quick guide to help you make the right technology choice for your project.
The Data Management Platform (DMP) includes Data Management Studio, Data Management Server, and several other additional modules and content libraries. Taken together, these components provide a robust environment for developing and deploying data quality-centric processes. Unique to this environment is access to sophisticated data profiling, data quality and data enrichment algorithms; the ability to deploy processes in batch and real-time modes; and an adroitness with heterogeneous IT environments. Here are just a few of the data integration features of this solution:
Data Integration Studio is visual design tool for building and deploying data integration processes. Its distinctive features are its SAS code generation underpinnings, a multitude of built-in transformations, close integration with SAS metadata and lots of other enterprise ETL capabilities. Much of what you can do in Data Integration Studio can be done with manual SAS programming but you get little of the code manageability and none of the rich graphical user interface that Data Integration Studio provides. Among its features are these integration capabilities:
Visual Process Orchestration is a web application that can tie together various SAS code-based and non SAS code-based data integration and data management processes in a visual data workflow environment. It is differentiated by its web-based user interface and built-in logical data flow processing. It also nicely spans the Data Management Platform and SAS code-based data management technologies. Visual Process Orchestration has the following integration options:
You can see that these applications have many ways to communicate with other technologies and with each other. You could for example construct processes that interact with each other like this:
I’ve only scratched the surface of the deep set of features provide by the applications discussed here. Understanding the technology options you have for data management and the interplay among these applications is the first step in making the right choice for your project.
Do you have any projects to share where you had to use two or more of these technologies in an innovative way? Did you use the integration features listed here or maybe a few that I missed?