turn on suggestions

Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.

Showing results for

Find a Community

- Home
- /
- Analytics
- /
- Stat Procs
- /
- Stopping rule for chi-squared discretization algor...

Topic Options

- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Highlight
- Email to a Friend
- Report Inappropriate Content

10-28-2011 10:41 AM

I developed an algorithm that uses the chi-squared test to perform supervised discretization of a continuous variable. I described it in the paper "ChiD-A Chi-Squared Discretization Algorithm" published in the WUSS 2011 Proceedings available at http://www.wuss.org/proceedings11/

The stopping criterion is not very intelligent, and I would like to know if there are better ways of stopping the discretization process.

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Highlight
- Email to a Friend
- Report Inappropriate Content

10-28-2011 11:04 AM

I should begin by admitting that I am not a statistician and am not familiar with either the method you are using or with IML.

That said, when I have confronted situations where I needed to incorporate a somewhat intelligent stopping point, I found it useful to apply a rather brute force approach, namely to wrap the code within a macro that uses a binary decision tree to test various criteria until an acceptible limit is reached.

Of course, if you are asking what such a criterion might be, please just ignore this post.

- Mark as New
- Bookmark
- Subscribe
- Subscribe to RSS Feed
- Highlight
- Email to a Friend
- Report Inappropriate Content

10-28-2011 11:22 AM

Probably a good general approach. In IML you don't even need to use a macro. In IML you can just wrap the code in a module definition and call the module at each step of an iterative method.

For example, if you're trying to find a zero, see http://blogs.sas.com/content/iml/2011/08/03/finding-the-root-of-a-univariate-function/ Or, if you're trying to optimize some criterion, see http://blogs.sas.com/content/iml/2011/10/12/maximum-likelihood-estimation-in-sasiml/