<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Fuzzy reject inference in SAS Enterprise Miner in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Fuzzy-reject-inference-in-SAS-Enterprise-Miner/m-p/721713#M8582</link>
    <description>&lt;P&gt;Hi, I am trying to understand how SAS EM conducts the fuzzy method for reject inference. According to the documentation (&lt;A title="Reject Inference Node" href="https://documentation.sas.com/?docsetId=emref&amp;amp;docsetTarget=p07a3ma2a34qvqn1goqdq5y2dbfr.htm&amp;amp;docsetVersion=14.3&amp;amp;locale=en#p1ssnwymlrdzsen1v8am2yofuy24" target="_self"&gt;Reject Inference Node&lt;/A&gt;&amp;nbsp;or&amp;nbsp;&lt;A title="Reject Inference Techniques Implemented in Credit Scoring for SAS Enterprise Miner" href="https://support.sas.com/resources/papers/proceedings09/305-2009.pdf?utm_source=youtube&amp;amp;utm_medium=social-sprinklr&amp;amp;utm_content=3288772800&amp;amp;utm_term=3288772800" target="_self"&gt;Reject Inference Techniques Implemented in Credit Scoring for SAS Enterprise Miner&lt;/A&gt;), SAS EM creates two observations in the augmented data set for each original observation in the rejects data set. In the first observation, a target value of 0 is assigned. In the second observation, a target value of 1 is assigned. The two observations are then individually weighted by the posterior probabilities, P(non-event) and P(event), respectively. The posterior probabilities, P(non-event) and P(event), are estimated from the model that was trained on the accepts (or known good-bad) data set.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;A common frequency weight, called the reject weight, is then assigned to both observations to account for any over-sampling or under-sampling of the rejects data. The reject weight is computed as follows:&lt;/P&gt;&lt;DIV class="xis-equation"&gt;&lt;DIV class="xis-graphicAndDescription"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="xis-graphicAndDescription"&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Capture.PNG" style="width: 272px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/55137i0655B7F08D6879E0/image-dimensions/272x75?v=v2" width="272" height="75" role="button" title="Capture.PNG" alt="Capture.PNG" /&gt;&lt;/span&gt;&lt;/DIV&gt;&lt;DIV class="xis-graphicAndDescription"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV class="xis-paragraph"&gt;In the above equation, Naccepts&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;is the weighted number of observations in the accepts data set. That is, it is the number of observations in the accepts data set after frequency weights have been applied. Nrejects&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;is the number of observations in the rejects data set; it is unweighted.&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="xis-paragraph"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="xis-paragraph"&gt;My question is, what is the&amp;nbsp;weighted number of observations in the accepts data set? For example, let's assume we have a dataset on bank credit card application delinquency. This data has 100 funded records, 200 approved but not funded records, and 300 rejected records. Using these numbers, is it correct that:&lt;/DIV&gt;&lt;UL&gt;&lt;LI&gt;rejection rate = 300 / (600) ?&lt;/LI&gt;&lt;LI&gt;Nrejects&amp;nbsp;&lt;FONT size="4"&gt;= 300?&lt;/FONT&gt;&lt;/LI&gt;&lt;LI&gt;What is our Naccepts&amp;nbsp;&lt;FONT size="4"&gt;in this example?&lt;/FONT&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT size="4"&gt;Thank you.&lt;/FONT&gt;&lt;/P&gt;&lt;DIV class="xis-paragraph"&gt;&amp;nbsp;&lt;/DIV&gt;</description>
    <pubDate>Wed, 24 Feb 2021 23:06:08 GMT</pubDate>
    <dc:creator>newboy1218</dc:creator>
    <dc:date>2021-02-24T23:06:08Z</dc:date>
    <item>
      <title>Fuzzy reject inference in SAS Enterprise Miner</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Fuzzy-reject-inference-in-SAS-Enterprise-Miner/m-p/721713#M8582</link>
      <description>&lt;P&gt;Hi, I am trying to understand how SAS EM conducts the fuzzy method for reject inference. According to the documentation (&lt;A title="Reject Inference Node" href="https://documentation.sas.com/?docsetId=emref&amp;amp;docsetTarget=p07a3ma2a34qvqn1goqdq5y2dbfr.htm&amp;amp;docsetVersion=14.3&amp;amp;locale=en#p1ssnwymlrdzsen1v8am2yofuy24" target="_self"&gt;Reject Inference Node&lt;/A&gt;&amp;nbsp;or&amp;nbsp;&lt;A title="Reject Inference Techniques Implemented in Credit Scoring for SAS Enterprise Miner" href="https://support.sas.com/resources/papers/proceedings09/305-2009.pdf?utm_source=youtube&amp;amp;utm_medium=social-sprinklr&amp;amp;utm_content=3288772800&amp;amp;utm_term=3288772800" target="_self"&gt;Reject Inference Techniques Implemented in Credit Scoring for SAS Enterprise Miner&lt;/A&gt;), SAS EM creates two observations in the augmented data set for each original observation in the rejects data set. In the first observation, a target value of 0 is assigned. In the second observation, a target value of 1 is assigned. The two observations are then individually weighted by the posterior probabilities, P(non-event) and P(event), respectively. The posterior probabilities, P(non-event) and P(event), are estimated from the model that was trained on the accepts (or known good-bad) data set.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;A common frequency weight, called the reject weight, is then assigned to both observations to account for any over-sampling or under-sampling of the rejects data. The reject weight is computed as follows:&lt;/P&gt;&lt;DIV class="xis-equation"&gt;&lt;DIV class="xis-graphicAndDescription"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="xis-graphicAndDescription"&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Capture.PNG" style="width: 272px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/55137i0655B7F08D6879E0/image-dimensions/272x75?v=v2" width="272" height="75" role="button" title="Capture.PNG" alt="Capture.PNG" /&gt;&lt;/span&gt;&lt;/DIV&gt;&lt;DIV class="xis-graphicAndDescription"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV class="xis-paragraph"&gt;In the above equation, Naccepts&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;is the weighted number of observations in the accepts data set. That is, it is the number of observations in the accepts data set after frequency weights have been applied. Nrejects&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;is the number of observations in the rejects data set; it is unweighted.&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="xis-paragraph"&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV class="xis-paragraph"&gt;My question is, what is the&amp;nbsp;weighted number of observations in the accepts data set? For example, let's assume we have a dataset on bank credit card application delinquency. This data has 100 funded records, 200 approved but not funded records, and 300 rejected records. Using these numbers, is it correct that:&lt;/DIV&gt;&lt;UL&gt;&lt;LI&gt;rejection rate = 300 / (600) ?&lt;/LI&gt;&lt;LI&gt;Nrejects&amp;nbsp;&lt;FONT size="4"&gt;= 300?&lt;/FONT&gt;&lt;/LI&gt;&lt;LI&gt;What is our Naccepts&amp;nbsp;&lt;FONT size="4"&gt;in this example?&lt;/FONT&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT size="4"&gt;Thank you.&lt;/FONT&gt;&lt;/P&gt;&lt;DIV class="xis-paragraph"&gt;&amp;nbsp;&lt;/DIV&gt;</description>
      <pubDate>Wed, 24 Feb 2021 23:06:08 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Fuzzy-reject-inference-in-SAS-Enterprise-Miner/m-p/721713#M8582</guid>
      <dc:creator>newboy1218</dc:creator>
      <dc:date>2021-02-24T23:06:08Z</dc:date>
    </item>
  </channel>
</rss>

