11-15-2015 08:41 AM
Hi everybody, I'm quite stuck with the following issue with Enterprise Miner 13.2.
I've got two similar dataset (around 200.000 obs and 300 variables), one for training purpose and the other for testing (same variables obviously, just the target is missing in the test dataset).
Long story short, I built and compared 5 different models: I can use any of them for scoring, except the "best" one, a gradient boosting model. After a few minutes attempting to score just a tiny sample of the test dataset (100 observations) I get this message: "System error - Program halted in server workspace method".
I've got another simpler GB model (fewer iterations, smaller depth) that I can use successfully to score.
11-15-2015 11:59 AM
Below a quick thing to try. I have another couple of ideas depending how complex your diagram is.
If this is urgent, please also submit a Tech support form to confirm what's going on.
What happens if you add just a Score node to the "Best" Boosting model. Just Boosting->Score, do not connect your scoring data, does that run OK? If yes, you can use the optimized score code to give it a try to manually score your test data.
How Complex is your diagram? can you share a screenshot? If there are no transformations or new variables in that flow, then you might use the EMFlowscore that your gradient boosting node has already produced.
Start with these two ideas and let's figure out what's happening with that node of yours.
11-16-2015 01:37 PM
First, thanks for using SAS. My name is Jason Xin, analytics solution architect focused on financial services.
I was helping another customer when I saw your post. I searched SAS.com's technical support and found this support Note that I think is relevant to your error message.
"Problem Note 15720: Enterprise Miner nodes fail when Server Initialization code contains a syntax error"
If you can share with us what your log says, that may be very helpful. Or have you tried the Optimized code? Thanks.
11-16-2015 02:36 PM
thanks for the prompt responses.
I tried just the boosting -> score without the scoring dataset, still the same error.
And the diagram is really easy: training data, partitioning, models, model compare and then score.
I have yet to try the EMFlowscore.
As for the problem note 15720, I checked View->Project Log for strings like "error", "not valid" and such, no luck.
Jason, you were asking for the Project Log or some other log?
By the way, as the simpler models run with no problem whatsoever, could it be something related to memory allocation or model "size"?
11-17-2015 11:43 AM - edited 11-17-2015 02:10 PM
Since your diagram is simple and your "Best" gradient boosting node is running OK, let's try to use the score code manually.
First, identify the ID of your gradient boosting node (click on it, and on top of their properties find the Node ID e.g. Boost3). Do the same for your workspace, click anywhere blank on your diagram and find the ID (e.g. EMWS1).
Next, locate the physical path of your workspace. One way to do it: click on the explorer icon, select Show Project Data, and confirm the physical path.
Now go to base SAS and use code similar to the below. I mocked up my test data, but you can start on the section Score manually.
/*Create a test set. To mock it up, I dropped the target variable*/ data test; set sampsio.hmeq; drop bad; run; /*Score manually*/ %let path=D:\EM\EM_Projects\EM14.1\mim\demo\Workspaces\EMWS1; %let nodeid=Boost3; filename emflow "&path.\&nodeid.\EMPUBLISHSCORE.sas"; data test; set test; %inc emflow; run;
Hopefully scoring manually gets you somewhere?
Also, if you have time, send me the log of the Score node that is failing. When you are on the results, go to View-SAS Results-> log. Do CTRL+F to try to find "error: ". You can use direct message if that is better, or through tech support.
11-17-2015 02:01 PM
11-17-2015 02:11 PM
11-17-2015 05:09 PM
11-17-2015 10:14 PM