<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Weight of evidence in New SAS User</title>
    <link>https://communities.sas.com/t5/New-SAS-User/Weight-of-evidence/m-p/553364#M9362</link>
    <description>&lt;P&gt;Hi all,&lt;/P&gt;&lt;P&gt;I have a question about weight of evidence for credit score modeling. I found that researcher are using the two contrasting methods to calculate the WOE.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1. Probability of default: Assuming that the probability of default is event (y=1), I found that WOE is ln(distr of good/distr of bad) in Siddiqi book, but some are using just opposite like this: ln(distr of bad/distr of good).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;2. For credit card application: Assuming the credit card approval (y=1),&amp;nbsp;WOE is ln(distr of bad/distr of good). Is it right or just opposite?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Either way, do we get the same IV value?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;SAS book says that WOE depends upon how to define the event or non-event and provides this log ratio: ln(% non event / % event)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I got really confused? HELP??&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Bikash&lt;/P&gt;</description>
    <pubDate>Tue, 23 Apr 2019 17:40:50 GMT</pubDate>
    <dc:creator>bikashten</dc:creator>
    <dc:date>2019-04-23T17:40:50Z</dc:date>
    <item>
      <title>Weight of evidence</title>
      <link>https://communities.sas.com/t5/New-SAS-User/Weight-of-evidence/m-p/553364#M9362</link>
      <description>&lt;P&gt;Hi all,&lt;/P&gt;&lt;P&gt;I have a question about weight of evidence for credit score modeling. I found that researcher are using the two contrasting methods to calculate the WOE.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;1. Probability of default: Assuming that the probability of default is event (y=1), I found that WOE is ln(distr of good/distr of bad) in Siddiqi book, but some are using just opposite like this: ln(distr of bad/distr of good).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;2. For credit card application: Assuming the credit card approval (y=1),&amp;nbsp;WOE is ln(distr of bad/distr of good). Is it right or just opposite?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Either way, do we get the same IV value?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;SAS book says that WOE depends upon how to define the event or non-event and provides this log ratio: ln(% non event / % event)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I got really confused? HELP??&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Bikash&lt;/P&gt;</description>
      <pubDate>Tue, 23 Apr 2019 17:40:50 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/Weight-of-evidence/m-p/553364#M9362</guid>
      <dc:creator>bikashten</dc:creator>
      <dc:date>2019-04-23T17:40:50Z</dc:date>
    </item>
    <item>
      <title>Re: Weight of evidence</title>
      <link>https://communities.sas.com/t5/New-SAS-User/Weight-of-evidence/m-p/553370#M9363</link>
      <description>&lt;P&gt;It really doesn't matter.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If you use&amp;nbsp;&lt;SPAN&gt;ln(distr of good/distr of bad) then big numbers are good, and small numbers are bad.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;If you use&amp;nbsp;ln(distr of bad/distr of good) then big numbers are bad, and small numbers are good.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;So which do you prefer, big numbers are good, or small numbers are good?&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 23 Apr 2019 17:49:44 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/Weight-of-evidence/m-p/553370#M9363</guid>
      <dc:creator>PaigeMiller</dc:creator>
      <dc:date>2019-04-23T17:49:44Z</dc:date>
    </item>
    <item>
      <title>Re: Weight of evidence</title>
      <link>https://communities.sas.com/t5/New-SAS-User/Weight-of-evidence/m-p/553374#M9364</link>
      <description>&lt;P&gt;Hi Paige,&lt;/P&gt;&lt;P&gt;These two statement are not always right:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;If you use&amp;nbsp;ln(distr of good/distr of bad) then big numbers are good, and small numbers are bad.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;If you use&amp;nbsp;ln(distr of bad/distr of good) then big numbers are bad, and small numbers are good.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;For WOE, I am looking for the ratio of %of good to %of bad, not total number of good/total number of bad for a particular category. I also thought the same thing: it does not matter which way we formulate the WOE, but we will get the same IV value for predicting its performance. &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;For the sake of easy interpretation, I have seen a couple of papers using this formula too:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;SPAN class="mrow"&gt;&lt;SPAN class="msubsup"&gt;&lt;SPAN class="mtext"&gt;WOE&lt;/SPAN&gt;&lt;SPAN class="texatom"&gt;&lt;SPAN class="mi"&gt;i&lt;/SPAN&gt;&lt;SPAN class="mi"&gt;j&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN class="mo"&gt;=&lt;/SPAN&gt;&lt;SPAN class="mi"&gt;log(&lt;/SPAN&gt;&lt;SPAN class="mfrac"&gt;&lt;SPAN class="mi"&gt;P&lt;/SPAN&gt;&lt;SPAN class="mo"&gt;(&lt;/SPAN&gt;&lt;SPAN class="msubsup"&gt;&lt;SPAN class="mi"&gt;X&lt;/SPAN&gt;&lt;SPAN class="mi"&gt;j&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN class="mo"&gt;∈&lt;/SPAN&gt;&lt;SPAN class="msubsup"&gt;&lt;SPAN class="mi"&gt;B&lt;/SPAN&gt;&lt;SPAN class="mi"&gt;i&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN class="texatom"&gt;&lt;SPAN class="mo"&gt;|&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN class="mi"&gt;Y&lt;/SPAN&gt;&lt;SPAN class="mo"&gt;=&lt;/SPAN&gt;&lt;SPAN class="mn"&gt;1&lt;/SPAN&gt;&lt;SPAN class="mo"&gt;)/&lt;/SPAN&gt;&lt;SPAN class="mi"&gt;P&lt;/SPAN&gt;&lt;SPAN class="mo"&gt;(&lt;/SPAN&gt;&lt;SPAN class="msubsup"&gt;&lt;SPAN class="mi"&gt;X&lt;/SPAN&gt;&lt;SPAN class="mi"&gt;j&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN class="mo"&gt;∈&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="msubsup"&gt;&lt;SPAN class="mi"&gt;B&lt;/SPAN&gt;&lt;SPAN class="mi"&gt;i&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN class="texatom"&gt;&lt;SPAN class="mo"&gt;|&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN class="mi"&gt;Y&lt;/SPAN&gt;&lt;SPAN class="mo"&gt;=&lt;/SPAN&gt;&lt;SPAN class="mn"&gt;0&lt;/SPAN&gt;&lt;SPAN class="mo"&gt;)).&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;SPAN class="mrow"&gt;&lt;SPAN class="mfrac"&gt;&lt;SPAN class="mo"&gt;Thanks,&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;SPAN class="mrow"&gt;&lt;SPAN class="mfrac"&gt;&lt;SPAN class="mo"&gt;Bikash&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 23 Apr 2019 18:18:18 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/Weight-of-evidence/m-p/553374#M9364</guid>
      <dc:creator>bikashten</dc:creator>
      <dc:date>2019-04-23T18:18:18Z</dc:date>
    </item>
    <item>
      <title>Re: Weight of evidence</title>
      <link>https://communities.sas.com/t5/New-SAS-User/Weight-of-evidence/m-p/553641#M9399</link>
      <description>&lt;P&gt;For your first Q:&lt;/P&gt;
&lt;P&gt;Both are right. They just have +beta or -beta .&lt;/P&gt;
&lt;P&gt;All you need is checking the high score group should have lower bad percent.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;ln(distr of good/distr of bad) in Siddiqi book,&amp;nbsp; &amp;nbsp;---&amp;gt;&amp;nbsp; model good_bad(event='bad')=&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt; but some are using just opposite like this: ln(distr of bad/distr of good).&lt;/SPAN&gt; ---&amp;gt;&amp;nbsp; &amp;nbsp;model good_bad(event='good')=&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;For your second Q:&lt;/P&gt;
&lt;P&gt;"&lt;SPAN&gt;WOE is ln(distr of bad/distr of good).&amp;nbsp;"&amp;nbsp; &amp;nbsp; &amp;nbsp; should be&amp;nbsp; &amp;nbsp; &amp;nbsp;"model y(event='0')=&amp;nbsp; ".&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;But you need check if the higher score have the lower bad percent. If not ,then switch into "model y(event='1')= "&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;"Either way, do we get the same IV value?"&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Yes. Both have the same IV .&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 24 Apr 2019 13:30:06 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/Weight-of-evidence/m-p/553641#M9399</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2019-04-24T13:30:06Z</dc:date>
    </item>
    <item>
      <title>Re: Weight of evidence</title>
      <link>https://communities.sas.com/t5/New-SAS-User/Weight-of-evidence/m-p/554990#M9594</link>
      <description>&lt;P&gt;Thanks Ksharp for your clarification.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 30 Apr 2019 12:18:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/New-SAS-User/Weight-of-evidence/m-p/554990#M9594</guid>
      <dc:creator>bikashten</dc:creator>
      <dc:date>2019-04-30T12:18:31Z</dc:date>
    </item>
  </channel>
</rss>

