BookmarkSubscribeRSS Feed
Tenno1
Calcite | Level 5

I developed an algorithm that uses the chi-squared test to perform supervised discretization of a continuous variable. I described it in the paper "ChiD-A Chi-Squared Discretization Algorithm" published in the WUSS 2011 Proceedings available at http://www.wuss.org/proceedings11/

The stopping criterion is not very intelligent, and I would like to know if there are better ways of stopping the discretization process.

2 REPLIES 2
art297
Opal | Level 21

I should begin by admitting that I am not a statistician and am not familiar with either the method you are using or with IML.

That said, when I have confronted situations where I needed to incorporate a somewhat intelligent stopping point, I found it useful to apply a rather brute force approach, namely to wrap the code within a macro that uses a binary decision tree to test various criteria until an acceptible limit is reached.

Of course, if you are asking what such a criterion might be, please just ignore this post.

Rick_SAS
SAS Super FREQ

Probably a good general approach. In IML you don't even need to use a macro. In IML you can just wrap the code in a module definition and call the module at each step of an iterative method.

For example, if you're trying to find a zero, see http://blogs.sas.com/content/iml/2011/08/03/finding-the-root-of-a-univariate-function/ Or, if you're trying to optimize some criterion, see http://blogs.sas.com/content/iml/2011/10/12/maximum-likelihood-estimation-in-sasiml/

SAS Innovate 2025: Call for Content

Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!

Submit your idea!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 2 replies
  • 1381 views
  • 0 likes
  • 3 in conversation