Hi All, I have some few questions on some Enterprise Miner nodes. Hope you can help me. (1) Regression Node - Logistic Regression - I noticed that it doesn't generate R-squared values and the usual goodness of fit statistics, like what the Proc Logistic does. I thought of using the SAS Code node instead to get the complete set of stats through Proc logistic. But before I switch to that maybe you have some ideas how to check it in the EM results page/s? Maybe I missed it out. - Other option I thought of is the HP Regression node which generates the Hosmer-Lemeshow, 3 R-square values and Somers' D by default. However, the results of this node are not as user-friendly as Regression node (ie, Beta estimates, odds ratios etc). I can just use it though just to get those stats. I have yet to check if it gives me the same result as Regression node - in terms of coefficients and significant variables. Does anybody have any thoughts on this? (2) HP Regression Node - Logistic Regression - Just wanted to make sure that the values under "PARTITION FIT STATISTICS" for Hosmer-lemeshow is the statistic itself and not the p-value? I tried to search for the technical documentation in Eminer Help and even on the internet but it doesn't give me the specifics - just the formula. (3) Cutoff Node - After developing my logistic regression model, I wanted to play around and analyze different cutoffs. But when I tried to do so, it gives me a runtime error everytime. Upon checking the logs, it shows: "ERROR: Undeclared array referenced: symputx". - The only different thing I did for this model development is that I did oversampling due to rare event (0.78%) and then added Decisions node to adjust priors to ensure adjusment in my predicted probabilities. I don't see any reasons why it should create some error though. Would appreciate any thoughts and suggestions. Thanks in advance!
... View more