BookmarkSubscribeRSS Feed
🔒 This topic is solved and locked. Need further help from the community? Please sign in and ask a new question.
karendean
Calcite | Level 5

proc dmdb batch data=sampsio.assocs out=dmassoc dmdbcat=catassoc; id customer; class product(desc); run;

proc assoc data=sampsio.assocs dmdbcat=catassoc out=datassoc(label='Output from Proc Assoc') items=5 support=20;cust customer; target product; run;

what is the purpose of the first line

why can't I just use proc assoc

1 ACCEPTED SOLUTION

Accepted Solutions
DougWielenga
SAS Employee

There are two things to be aware of with the ASSOC procedure (and other non-HP SAS data mining procedures). 

    1 - The procedures are designed to work against data mining data bases, not SAS data sets

    2 - The procedures are designed to be used by SAS Enterprise Miner, not directly called in SAS code

 

The name of the DMDB procedure refers to the Data Mining Data Base that is produces.  Standard SAS procedures are optimized to run against SAS data sets which include the data plus some information about each variable (e.g. type of data -- character or numeric, formats, lengths, etc...).   In standard SAS procedures, you always need to specify how each variable is being used in each procedure.   SAS Enterprise Miner has a concept called metadata which tells you additional information about the variable.  For example, it has more information about each variable's data type (e.g. binary, nominal, ordinal, nominal, text, freq, etc...)  and each variable's role (ID, Input, Target, Segment, etc...).   A Data Mining Data Base is simply a SAS data set with an associated catalog which contains this additional information.  The benefit of the DMDB is that the software allows you to define how a variable should be used and then it carries that information through each node/proc so that you don't need to specify it each time.  You can also change the metadata which updates the DMDB between processes.   However, you cannot run a standard SAS data set into a non-HP data mining procedure running in SAS Enterprise Miner. 

 

Hope this helps!

Doug

View solution in original post

2 REPLIES 2
ChrisHemedinger
Community Manager

I moved this discussion to the Data Mining community.  PROC DMDB creates a SAS catalog that has metadata about the characteristics of your data, which is required input for other data mining procedures.  One of our data mining experts might be able to provide more information.

Chris

It's time to register for SAS Innovate! Join your SAS user peers in Las Vegas on April 16-19 2024.
DougWielenga
SAS Employee

There are two things to be aware of with the ASSOC procedure (and other non-HP SAS data mining procedures). 

    1 - The procedures are designed to work against data mining data bases, not SAS data sets

    2 - The procedures are designed to be used by SAS Enterprise Miner, not directly called in SAS code

 

The name of the DMDB procedure refers to the Data Mining Data Base that is produces.  Standard SAS procedures are optimized to run against SAS data sets which include the data plus some information about each variable (e.g. type of data -- character or numeric, formats, lengths, etc...).   In standard SAS procedures, you always need to specify how each variable is being used in each procedure.   SAS Enterprise Miner has a concept called metadata which tells you additional information about the variable.  For example, it has more information about each variable's data type (e.g. binary, nominal, ordinal, nominal, text, freq, etc...)  and each variable's role (ID, Input, Target, Segment, etc...).   A Data Mining Data Base is simply a SAS data set with an associated catalog which contains this additional information.  The benefit of the DMDB is that the software allows you to define how a variable should be used and then it carries that information through each node/proc so that you don't need to specify it each time.  You can also change the metadata which updates the DMDB between processes.   However, you cannot run a standard SAS data set into a non-HP data mining procedure running in SAS Enterprise Miner. 

 

Hope this helps!

Doug

sas-innovate-2024.png

Don't miss out on SAS Innovate - Register now for the FREE Livestream!

Can't make it to Vegas? No problem! Watch our general sessions LIVE or on-demand starting April 17th. Hear from SAS execs, best-selling author Adam Grant, Hot Ones host Sean Evans, top tech journalist Kara Swisher, AI expert Cassie Kozyrkov, and the mind-blowing dance crew iLuminate! Plus, get access to over 20 breakout sessions.

 

Register now!

How to choose a machine learning algorithm

Use this tutorial as a handy guide to weigh the pros and cons of these commonly used machine learning algorithms.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 3506 views
  • 0 likes
  • 3 in conversation