BookmarkSubscribeRSS Feed

ChiD, A X^2 - Based Discretization Algorithm

Started ‎02-16-2022 by
Modified ‎02-16-2022 by
Views 2,161

I published a paper in the Washington Users of SAS Software meeting in 2011 on the topic of discretization of a continuous variable using the Chi-Squared statistic, and I am sharing it with the SAS community.

 

I describe the taxonomy of discretization algorithms, give an example of ChiMerge, a predecessor of the ChiD algorithm that I developed, and present comparative results between the ChiD algorithm and the SAS Enterprise Miner 4.3 bucketing algorithm. I discuss the results of my comparison and conclude that the ChiD algorithm generates cutsets that are of similar quality to those computed by the Enterprise Miner decision tree algorithm.

 

The conference paper and the SAS code for the ChD algorithm are included as attachments.

 

Version history
Last update:
‎02-16-2022 05:42 PM
Updated by:

SAS Innovate 2025: Call for Content

Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!

Submit your idea!

Free course: Data Literacy Essentials

Data Literacy is for all, even absolute beginners. Jump on board with this free e-learning  and boost your career prospects.

Get Started

Article Tags