<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Binning a set of Continuous Variables using Percentiles for WOE Transformation &amp;amp; Logistic Regression in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Binning-a-set-of-Continuous-Variables-using-Percentiles-for-WOE/m-p/276760#M55485</link>
    <description>&lt;DIV class="gmail_default"&gt;I have a SAS Programming Problem that you may have already solved:&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;My Data Set contains three sets of continuous variables:&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;DQ01 - DQ59 &amp;nbsp;DE01 - DE59 &amp;nbsp;&amp;amp; &amp;nbsp;DL01 - DL59.&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;( 177 variables ) Each standardised with Mean = 50 and Variance &amp;nbsp;= 100&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;The basic Statistical problem is Binary Logistic Regression.&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;1. I want to bin each continuous variable using deciles or semi-deciles&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp; &amp;nbsp; that have been computed using PROC Univariate / Summary.&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;2. Compute and output the Percentiles for each Variable.&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;3. For each variable compare the observed values with the Percentile&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp; &amp;nbsp; Cut-Points and then allocate that observation to a Decile Bin.&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;4. Optimise the Bin Allocation based on a metric such as the GINI.&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;5. Apply a Robust WOE Transformation to each Binned Variable.&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp; &amp;nbsp;subject to the following constraints:&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp; &amp;nbsp;a. The % frequency within each bin &amp;gt; 5%&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp; &amp;nbsp;b. The WOE transformation is Monotonic&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;6. Fit a Binary Logistic Regression Model to the WOE-Transformed Variables.&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;If you have any advice or suggestions w.r.t. the above please let me know.&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;Regards&lt;/DIV&gt;</description>
    <pubDate>Sun, 12 Jun 2016 13:22:58 GMT</pubDate>
    <dc:creator>JonDickens1607</dc:creator>
    <dc:date>2016-06-12T13:22:58Z</dc:date>
    <item>
      <title>Binning a set of Continuous Variables using Percentiles for WOE Transformation &amp; Logistic Regression</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Binning-a-set-of-Continuous-Variables-using-Percentiles-for-WOE/m-p/276760#M55485</link>
      <description>&lt;DIV class="gmail_default"&gt;I have a SAS Programming Problem that you may have already solved:&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;My Data Set contains three sets of continuous variables:&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;DQ01 - DQ59 &amp;nbsp;DE01 - DE59 &amp;nbsp;&amp;amp; &amp;nbsp;DL01 - DL59.&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;( 177 variables ) Each standardised with Mean = 50 and Variance &amp;nbsp;= 100&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;The basic Statistical problem is Binary Logistic Regression.&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;1. I want to bin each continuous variable using deciles or semi-deciles&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp; &amp;nbsp; that have been computed using PROC Univariate / Summary.&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;2. Compute and output the Percentiles for each Variable.&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;3. For each variable compare the observed values with the Percentile&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp; &amp;nbsp; Cut-Points and then allocate that observation to a Decile Bin.&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;4. Optimise the Bin Allocation based on a metric such as the GINI.&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;5. Apply a Robust WOE Transformation to each Binned Variable.&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp; &amp;nbsp;subject to the following constraints:&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp; &amp;nbsp;a. The % frequency within each bin &amp;gt; 5%&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp; &amp;nbsp;b. The WOE transformation is Monotonic&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;6. Fit a Binary Logistic Regression Model to the WOE-Transformed Variables.&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;If you have any advice or suggestions w.r.t. the above please let me know.&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="gmail_default"&gt;Regards&lt;/DIV&gt;</description>
      <pubDate>Sun, 12 Jun 2016 13:22:58 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Binning-a-set-of-Continuous-Variables-using-Percentiles-for-WOE/m-p/276760#M55485</guid>
      <dc:creator>JonDickens1607</dc:creator>
      <dc:date>2016-06-12T13:22:58Z</dc:date>
    </item>
    <item>
      <title>Re: Binning a set of Continuous Variables using Percentiles for WOE Transformation &amp; Logistic Re</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Binning-a-set-of-Continuous-Variables-using-Percentiles-for-WOE/m-p/282838#M57535</link>
      <description>I think this is a little too big for a forum post. Also you posted it twice in 2 different forums.</description>
      <pubDate>Fri, 08 Jul 2016 00:26:13 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Binning-a-set-of-Continuous-Variables-using-Percentiles-for-WOE/m-p/282838#M57535</guid>
      <dc:creator>JBerry</dc:creator>
      <dc:date>2016-07-08T00:26:13Z</dc:date>
    </item>
    <item>
      <title>Re: Binning a set of Continuous Variables using Percentiles for WOE Transformation &amp; Logistic Re</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Binning-a-set-of-Continuous-Variables-using-Percentiles-for-WOE/m-p/282897#M57566</link>
      <description>&lt;P&gt;My main problem was how to process a large number of variables using the same binning algorithm.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have constructed a solution for&amp;nbsp;the binning process for a single variable using proc rank.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Now I need a maco possibly using arrays that enables me to repeat the process and combine the output into a table.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Has this reduced the problem sufficiently?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 08 Jul 2016 10:31:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Binning-a-set-of-Continuous-Variables-using-Percentiles-for-WOE/m-p/282897#M57566</guid>
      <dc:creator>JonDickens1607</dc:creator>
      <dc:date>2016-07-08T10:31:43Z</dc:date>
    </item>
  </channel>
</rss>

