Your reasoning as to why range scaling is needed is correct - variables with larger ranges will dominate a nearest neighbor approach. The MBR node does not do any range scaling of your data, so you will need to handle this portion of the process external to the MBR node. You can range scale interval data by using the Transform Variables Node, and for the "Interval Inputs" property, select "Range." As for correlated variables, there are two answers... the MBR can weight based off of correlation with the target, but the MBR will NOT handle correlation between input variables. Each input variable is weighted by the absolute value of the correlation to the target variable. This will only apply in cases of interval targets or binary targets (nominal targets will multiple levels will NOT have a weighting based off of correlation). The "Weighted" property on the MBR Node controls whether you want weighting or not. This is different than your question, which I think is asking about correlated input variables. You are correct that having highly correlated input variables will skew the nearest neighbor results towards favoring the underlying mechanism. This is most likely a problem though, only if the variables are highly correlated. Highly correlated input variables affect more methods than just the MBR, so I would recommend that you always consider handling correlated variables. You can use the StatExplore node in Enterprise Miner to determine correlations, and then use the Metadata Node to reject some of these correlated variables. I hope this helps!
... View more