About jpindfw

jpindfw · ‎11-30-2015

Thanks, Miguel! We are still on 12.1 so I did find them under the same subdirectory in the file names OUTDESCRIBE.txt. I am still working on the report but I think it would be a great addition to SAS reporting of decision trees. I should be able to share it with you soon.

RalphAbbey · ‎09-02-2014

Your reasoning as to why range scaling is needed is correct - variables with larger ranges will dominate a nearest neighbor approach. The MBR node does not do any range scaling of your data, so you will need to handle this portion of the process external to the MBR node. You can range scale interval data by using the Transform Variables Node, and for the "Interval Inputs" property, select "Range." As for correlated variables, there are two answers... the MBR can weight based off of correlation with the target, but the MBR will NOT handle correlation between input variables. Each input variable is weighted by the absolute value of the correlation to the target variable. This will only apply in cases of interval targets or binary targets (nominal targets will multiple levels will NOT have a weighting based off of correlation). The "Weighted" property on the MBR Node controls whether you want weighting or not. This is different than your question, which I think is asking about correlated input variables. You are correct that having highly correlated input variables will skew the nearest neighbor results towards favoring the underlying mechanism. This is most likely a problem though, only if the variables are highly correlated. Highly correlated input variables affect more methods than just the MBR, so I would recommend that you always consider handling correlated variables. You can use the StatExplore node in Enterprise Miner to determine correlations, and then use the Metadata Node to reject some of these correlated variables. I hope this helps!

CraigDeVault · ‎07-25-2014

Please open up a support track by contacting SAS Technical Support at support@sas.com. I will be happy to help you with your question regarding SAS Sentiment Analysis. Thanks.

Paige · ‎03-05-2010

PROC CAPABILITY can fit a wide variety of distributions, including the Johnson family of distributions, which would allow you to fit very long tailed distributions.

Online Status	Offline
Date Last Visited	‎10-03-2017 01:45 PM

Re: SAS EM Decision Tree English Rules

SAS EM Decision Tree English Rules

SAS EM and Memory Based Reasoning

Sentiment Analysis Package - starter sets of pos/neg words/comments

Long Tail Distributions

Re: SAS EM Decision Tree English Rules

Re: SAS EM and Memory Based Reasoning

Re: Sentiment Analysis Package - starter sets of pos/neg words/comment...

Re: Long Tail Distributions