About Xamius32

DougWielenga · ‎12-04-2017

For many people viewing this thread, I suspect the answer provided by @WendyCzika will be most useful: I think you need to move the Model Comparison node before the End Groups node, so it will pick the best model for each group. Then you should see in the Score Code in the End Groups node, there is (or could be) a different model used for each group. See this tip for more info: https://communities.sas.com/t5/SAS-Communities-Library/Tip-How-to-Build-Stratified-Models-using-the-... For those who wish to fit a model to each subgroup but want to score all subgroups on all models, a different approach is necessary. As you have already found, Group Processing will not help you accomplish this. The only way to get the individual subgroup models is to fit them in separate paths where you have filtered out the subgroup of interest. You can then Score the entire training data set using the model from each subgroup path. You would then need to merge those data sets together by the ID variable in order to obtain scores for all observations on each subgroup model. You have to be careful because the default predicted target variable will be the same for all subset models. When you score the whole data table on each model, you will want to create a new predicted target variable for that model so that you can tell them apart. For example, if you have a binary target BAD which takes on values 1 and 0 where 1 is the event of interest, SAS Enterprise Miner would create the prediction variable P_BAD1 which is of the form P_<target variable name><target variable level> to store the predictions from each subset model. After scoring all the observations, create a new variable which equals the prediction variable of interest. For example, P_Group1_Reg = P_BAD1; and then export just the ID information and the new prediction variable. Once you have done this for each subset, you can merge the resulting data sets (containing only the ID and the new prediction variable for each observation) by the ID variable you are using. You can later merge in any additional information from the original data set such as the actual target value and any key predictors. If you are using partitioning. do the Data Partition node first but be sure to stratify on both the target (if categorical) and the subgroup variable. Then create a separate path for each subgroup, using a Filter node to subset out the observations for the category of interest. After fitting the model, attach a Score node and use a new Input Data Source node which has the complete Training data (having all subsets) but set the role to Score so the full data can be scored. You can then create the new prediction variable in a subsequent SAS Code node following the Score node for each subgroup using something like the following assuming the flow has a single binary target, MyID = ID variable for each observation, P_Grp1_Reg = new prediction variable (denotes Group 1 & Regression model), and you are writing to the path defined by the MyLib library: /*** BEGIN SAS CODE ***/ libname mylib " <path to the location where you are writing out the newly scored data> "; data mylib.grp1scores; set &EM_IMPORT_SCORE; P_Grp1_Reg=%EM_BINARY_TARGET; * Note: assumes a single binary target is used; keep MyID P_Grp1_Reg; proc sort data=mylib.grp1scores; by MyID; * prepare the data to be merged by MyID with the other subgroup scores; run; /*** END SAS CODE ***/ You could then easily merge all of the subgroup scores for the whole training data since they would have unique prediction variable names and common ID values sorted and ready for merging using something like the following: /*** BEGIN SAS CODE ***/ libname mylib " <path to the location where you are writing out the newly scored data> "; data mylib.allscores; merge mylib.grp1scores mylib.grp2scores mylib.grp3scores; by MyID; run; /*** END SAS CODE ***/ Hope this helps! Doug

CamillaGua · ‎04-24-2017

Hello Xamius32! Did you solve your problem? I'm having the same problem, but I don't know what to do. I'm using Miner 12.1. I put this %let EM_INTERACTIVE_TREE_MAXOBS= 126745 on the code, but nothing changes.

Ksharp · ‎01-04-2017

You need make design matrix on your own, and feed it into regression analysis. http://blogs.sas.com/content/iml/2016/02/24/create-a-design-matrix-in-sas.html

WendyCzika · ‎12-22-2016

HP Forest and HP SVM (when using active set optimization) are the only 2 HP models that do not produce SAS DATA step score code. But the HPDMSCORECODE.sas file contains SAS code that can be used for scoring. It uses a procedure, HP4SCORE, to score the forest model. An alternative way to score your forest model in a database is using the Analytic Store files; see this tip for more information about that: https://communities.sas.com/t5/SAS-Communities-Library/SAS-High-Performance-Analytics-tip-5-Scoring-with-Analytic-Store/ta-p/253544

Xamius32 · ‎12-21-2016

From what I understand the ensemble node can take the average or maximum of different models from the same training data, correct? My goal is a bit different since I am trying to take the maximum of different sets of training data

Xamius32 · ‎10-24-2016

So, I see that my socre node has different inputted data. One has the regression train data and one has the validation data, just not sure how that has happened.

Tom · ‎06-25-2015

Sounds like you want to create a problem report. Typically you would include the id variables (so someone can find the offending row), then rule and the key values that violate the rule. data want ; set have ; length rule $50 values $50 ; keep id rule values ; if x > y then do; rule = 'X>Y' ; values=catx(' ',catx('=','X',x),catx('=','Y',y)); output; end; run;

Cynthia_sas · ‎05-13-2015

The only way to apply style to specific variables is to use multiple VAR statements. If you have multiple vars that you want to style the same, you can list them on the same VAR statement. So, for example, see the code below. You can set some overall defaults on the PROC PRINT statement, but individual variables with differing styles will need to be on separate VAR statements. cynthia ods tagsets.excelxp file='c:\temp\stylechange.xml' style=htmlblue; proc print data=sashelp.shoes (obs=10) style(obsno)={background=pink color=navy} style(header)={background=pink} style(column)={font_face='Courier New'}; var region / style(header)={background=darkblue color=yellow font_weight=bold} style(data)={background=darkblue color=yellow font_weight=bold}; var subsidiary / style(header)={font_size=12pt font_weight=bold}; var product / style(data)={background=lightyellow}; var sales inventory returns / style(data)={background=grayaa font_size=12pt font_weight=bold}; run; ods tagsets.excelxp close;

IanWakeling · ‎01-31-2015

I have a function to do exactly this. It creates the 'shift' that you mention by manipulating the row index numbers of a matrix in its sparse format (details of the format can be found in the documentation for the SPARSE function). The result when converted back to normal form, with the FULL function, is a new matrix with one row for every antidiagonal. Finally it is a simple matter to sum the rows of this new matrix. start AntiDiagSum( x ); nr = nrow( x ); nc = ncol( x ); s = sparse( x ); s[ , 2] = s[ , 2] + s[ , 3] - 1; return( full( s , nr + nc - 1, nc )[ , +] ); finish; /* get example 3x5 matrix */ a = shape(1:15, 3, 5); print a; /* required sums are from the 2nd to 5th antidiagonals */ b = AntiDiagSum(a)[2:5]; print b;

Xamius32 · ‎12-09-2014

Thanks alot

Fugue · ‎08-05-2013

Are the errors showing up only on certain records, or the entire column?

Peter_C · ‎07-14-2013

If 15 joi s are a problem for one query, break the task into (upto) 15 separate queries

SteveDenham · ‎07-10-2013

It kind of depends on what coding you used for the dependent variable. Do your predicted probabilities look like (1 - expected prob)? Or are they completely mucked about? I wonder about this because of the 0 in your manual equation--I assume that it is a negative sign in your actual calculations. You might want to add XBETA=logitvalue to the OUTPUT statement to check if your difference is, in fact, due to rounding. Logitvalue should be equal to (intercept+ X*B1 +Y*B2 + Z*B3). Steve Denham

Rick_SAS · ‎07-03-2013

You can build a model with all interaction terms by using the "bar notation" such as model y = a | b | c | d; or restrict to only second order interactions with model y = a | b | c | d @2; See SAS/STAT(R) 9.3 User's Guide As Data Null says, you can use variable selection to select a model that incorporates a subset of the terms listed on the model statement.

yeshwanth · ‎06-06-2013

Okey, Try this Data final; set lp_200512 -- lp_200712; // ( Specify "--" will append all the data sets which are between 200512 to 200712) run;

Online Status	Offline
Date Last Visited	‎03-21-2017 01:03 PM

Re: Is there a way to only use one variable as a two-factor interactio...

Is there a way to only use one variable as a two-factor interaction va...

Re: Is there any way to export HP models scoring code?

Is there any way to export HP models scoring code?

Re: Custom model comparison criteria

Custom model comparison criteria

Re: Using start/end groups in Enterprise miner to obtain scores for ea...

Re: Using start/end groups in Enterprise miner to obtain scores for ea...

Using start/end groups in Enterprise miner to obtain scores for each g...

Re: Missing observations in scored data, no missing data

Multiple inner joins taking too long, easier way?

Re: Using start/end groups in Enterprise miner to obtain scores for ea...

Re: getting interactive decision tree in enterprise miner to use more ...

Re: Is there a way to only use one variable as a two-factor interactio...

Re: Is there any way to export HP models scoring code?

Re: Custom model comparison criteria

Re: Missing observations in scored data, no missing data

Re: Im trying to compare two variables and insert variable X into macr...

Re: Trying to print everything from a dataset but format/style a few o...

Re: Hi, I am new to IML, I am trying to add the reverse diagonals of e...

Re: Trying to use first. and last. processing in 3rd variable of a by ...

Re: SAS reading in date as character 9, having trouble converting

Re: Multiple inner joins taking too long, easier way?

Re: Proc logistic out and p statements

Re: Testing all combinations of a model statement (proc logistic)

Re: Joining dozens of tables with a macro