<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic True Positive is 0. Oversampling, adjusted prior, weight.  Need a help! in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/True-Positive-is-0-Oversampling-adjusted-prior-weight-Need-a/m-p/149338#M1495</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have a dataset with very small response rate (0.0057 or 0.57%), so I did oversampling method using several percentages from 8% to 50%.&amp;nbsp; I added decision nodes after sample nodes, and changed prior probably to my original prior probably which is 0.0057 for primary and 0.9943 for secondary. I also changed weights by clicking "Default with Inverse Prior Weights".&amp;nbsp; My ASE or Misclassification rates went up as I increased my sample size from 8% to 50%.&amp;nbsp; &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;My concern is the number of True Positive in my Model Comparison node.&amp;nbsp; They are all smaller than 10 regardless of my sampling or models (decision tree, Neural, logistic Regression, etc.).&amp;nbsp; After cleaning up my data, I have over 1 million rows (6445 for primary and the rest is secondary).&lt;/P&gt;&lt;P&gt;&amp;nbsp; &lt;/P&gt;&lt;P&gt;How can I do to improve my model or how should I make a decision about which model is the best if the number of True Positive is all less than 10 – many models are zeroes.&amp;nbsp; Also, my ROC charts are not curved (see below).&amp;nbsp;&amp;nbsp; &lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/P&gt;&lt;P&gt; I really appreciate for any advise. &lt;/P&gt;&lt;P&gt;&lt;IMG alt="ROC.png" class="jive-image-thumbnail jive-image" src="https://communities.sas.com/legacyfs/online/6262_ROC.png" width="450" /&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Event Classification Table&lt;/P&gt;&lt;P&gt;Model Selection based on Valid: Average Profit for RESPONSE_IND (_VAPROF_)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Model&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Data&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; False&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; True&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; False&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; True&lt;/P&gt;&lt;P&gt;Node&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Model Description&amp;nbsp;&amp;nbsp; Role&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Target&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Negative&amp;nbsp; Negative&amp;nbsp; Positive&amp;nbsp; Positive&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Reg&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Regression&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; TRAIN&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3223&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29002&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&lt;/P&gt;&lt;P&gt;Reg&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Regression&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; VALIDATE&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3222&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29003&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&lt;/P&gt;&lt;P&gt;Reg2&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Regression (2)&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; TRAIN&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3222&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29002&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 1&lt;/P&gt;&lt;P&gt;Reg2&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Regression (2)&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; VALIDATE&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3222&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29003&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; .&lt;/P&gt;&lt;P&gt;Reg3&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Regression (3)&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; TRAIN&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3222&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29002&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 1&lt;/P&gt;&lt;P&gt;Reg3&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Regression (3)&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; VALIDATE&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3222&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29003&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; .&lt;/P&gt;&lt;P&gt;Neural3&amp;nbsp; Neural Network (3)&amp;nbsp; TRAIN&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3223&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29002&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&lt;/P&gt;&lt;P&gt;Neural3&amp;nbsp; Neural Network (3)&amp;nbsp; VALIDATE&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3222&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29003&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&lt;/P&gt;&lt;P&gt;Neural2&amp;nbsp; Neural Network (2)&amp;nbsp; TRAIN&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3223&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29002&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&lt;/P&gt;&lt;P&gt;Neural2&amp;nbsp; Neural Network (2)&amp;nbsp; VALIDATE&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3222&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29003&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&lt;/P&gt;&lt;P&gt;Neural&amp;nbsp;&amp;nbsp; Neural Network&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; TRAIN&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3223&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29002&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&lt;/P&gt;&lt;P&gt;Neural&amp;nbsp;&amp;nbsp; Neural Network&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; VALIDATE&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3222&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29003&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Wed, 07 May 2014 19:12:39 GMT</pubDate>
    <dc:creator>kkasahar</dc:creator>
    <dc:date>2014-05-07T19:12:39Z</dc:date>
    <item>
      <title>True Positive is 0. Oversampling, adjusted prior, weight.  Need a help!</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/True-Positive-is-0-Oversampling-adjusted-prior-weight-Need-a/m-p/149338#M1495</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have a dataset with very small response rate (0.0057 or 0.57%), so I did oversampling method using several percentages from 8% to 50%.&amp;nbsp; I added decision nodes after sample nodes, and changed prior probably to my original prior probably which is 0.0057 for primary and 0.9943 for secondary. I also changed weights by clicking "Default with Inverse Prior Weights".&amp;nbsp; My ASE or Misclassification rates went up as I increased my sample size from 8% to 50%.&amp;nbsp; &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;My concern is the number of True Positive in my Model Comparison node.&amp;nbsp; They are all smaller than 10 regardless of my sampling or models (decision tree, Neural, logistic Regression, etc.).&amp;nbsp; After cleaning up my data, I have over 1 million rows (6445 for primary and the rest is secondary).&lt;/P&gt;&lt;P&gt;&amp;nbsp; &lt;/P&gt;&lt;P&gt;How can I do to improve my model or how should I make a decision about which model is the best if the number of True Positive is all less than 10 – many models are zeroes.&amp;nbsp; Also, my ROC charts are not curved (see below).&amp;nbsp;&amp;nbsp; &lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/P&gt;&lt;P&gt; I really appreciate for any advise. &lt;/P&gt;&lt;P&gt;&lt;IMG alt="ROC.png" class="jive-image-thumbnail jive-image" src="https://communities.sas.com/legacyfs/online/6262_ROC.png" width="450" /&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Event Classification Table&lt;/P&gt;&lt;P&gt;Model Selection based on Valid: Average Profit for RESPONSE_IND (_VAPROF_)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Model&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Data&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; False&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; True&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; False&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; True&lt;/P&gt;&lt;P&gt;Node&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Model Description&amp;nbsp;&amp;nbsp; Role&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Target&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Negative&amp;nbsp; Negative&amp;nbsp; Positive&amp;nbsp; Positive&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Reg&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Regression&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; TRAIN&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3223&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29002&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&lt;/P&gt;&lt;P&gt;Reg&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Regression&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; VALIDATE&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3222&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29003&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&lt;/P&gt;&lt;P&gt;Reg2&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Regression (2)&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; TRAIN&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3222&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29002&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 1&lt;/P&gt;&lt;P&gt;Reg2&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Regression (2)&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; VALIDATE&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3222&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29003&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; .&lt;/P&gt;&lt;P&gt;Reg3&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Regression (3)&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; TRAIN&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3222&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29002&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 1&lt;/P&gt;&lt;P&gt;Reg3&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Regression (3)&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; VALIDATE&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3222&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29003&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; .&lt;/P&gt;&lt;P&gt;Neural3&amp;nbsp; Neural Network (3)&amp;nbsp; TRAIN&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3223&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29002&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&lt;/P&gt;&lt;P&gt;Neural3&amp;nbsp; Neural Network (3)&amp;nbsp; VALIDATE&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3222&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29003&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&lt;/P&gt;&lt;P&gt;Neural2&amp;nbsp; Neural Network (2)&amp;nbsp; TRAIN&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3223&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29002&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&lt;/P&gt;&lt;P&gt;Neural2&amp;nbsp; Neural Network (2)&amp;nbsp; VALIDATE&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3222&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29003&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&lt;/P&gt;&lt;P&gt;Neural&amp;nbsp;&amp;nbsp; Neural Network&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; TRAIN&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3223&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29002&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&lt;/P&gt;&lt;P&gt;Neural&amp;nbsp;&amp;nbsp; Neural Network&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; VALIDATE&amp;nbsp; RESPONSE_IND&amp;nbsp;&amp;nbsp;&amp;nbsp; 3222&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 29003&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 07 May 2014 19:12:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/True-Positive-is-0-Oversampling-adjusted-prior-weight-Need-a/m-p/149338#M1495</guid>
      <dc:creator>kkasahar</dc:creator>
      <dc:date>2014-05-07T19:12:39Z</dc:date>
    </item>
    <item>
      <title>Re: True Positive is 0. Oversampling, adjusted prior, weight.  Need a help!</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/True-Positive-is-0-Oversampling-adjusted-prior-weight-Need-a/m-p/149339#M1496</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi KKasahar,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;It seems to me that you are on the right path. Your ROC plots will be more curved when you get better models. Remember that sensitivity is a&lt;EM&gt; true positive&lt;/EM&gt; rate and specifity is a&lt;EM&gt; true negative&lt;/EM&gt; rate. &lt;/P&gt;&lt;P&gt;Some ideas to get better models. Have you tried ensembles? Try a few of the below.&lt;/P&gt;&lt;P&gt;a) Gradient Boosting. Warning: it is going to take a while to run. Run a default Gradient Boosting node right after your data, no partition, no variable selection just your data, and Gradient Boosting, what happens?&lt;/P&gt;&lt;P&gt;b) Try the ensemble of Reg2 and Reg3 using the Ensemble node&lt;/P&gt;&lt;P&gt;c) Use ensembles like bagging as described in this paper: &lt;A href="http://support.sas.com/resources/papers/proceedings14/SAS133-2014.pdf" title="http://support.sas.com/resources/papers/proceedings14/SAS133-2014.pdf"&gt;http://support.sas.com/resources/papers/proceedings14/SAS133-2014.pdf&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;For anyone else reading this thread, I found a quick scoop of a rare event approach and some links to a longer answer and video.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 12.0pt; font-family: 'Arial','sans-serif'; color: #333333;"&gt;To create a flow that depicts a rare event case analysis in the SAS Enterprise Miner, complete the following steps:&lt;/SPAN&gt;&lt;/P&gt;&lt;OL start="1"&gt;&lt;LI&gt;&lt;SPAN style="font-size: 10.0pt; font-family: 'inherit','serif';"&gt;Create the data source. Define a target variable.&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN style="font-size: 10.0pt; font-family: 'inherit','serif';"&gt;To over-sample the rare event, add a Sample node, and attach it to the Input Data node. Set the Sample Method property to &lt;STRONG&gt;Stratify.&lt;/STRONG&gt;&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN style="font-size: 10.0pt; font-family: 'inherit','serif';"&gt;Click the ... (ellipsis) button beside the Variables property, and choose your target variable as the stratification variable.&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN style="font-size: 10.0pt; font-family: 'inherit','serif';"&gt;In the Stratified property section, set the Criterion property to &lt;STRONG&gt;Equal.&lt;/STRONG&gt; This setting causes the sample to use all of the event observations, and an equal number of randomly selected non-event observations. Set the Oversampling Adjust Frequency property to &lt;STRONG&gt;no&lt;/STRONG&gt;.&lt;/SPAN&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;&lt;STRONG style="color: #1f497d;"&gt;&lt;BR /&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG style="color: #1f497d;"&gt;For a beefier answer&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: #1f497d;"&gt; Read here&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;A href="http://support.sas.com/kb/24/205.html"&gt;http://support.sas.com/kb/24/205.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="color: #1f497d;"&gt; Find a video here&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;A href="http://support.sas.com/kb/34/270.html"&gt;http://support.sas.com/kb/34/270.html&lt;/A&gt; (video in the link when you click &lt;EM&gt;&lt;SPAN style="font-size: 10.0pt; font-family: 'Arial','sans-serif'; color: #333333;"&gt;See the attached &lt;/SPAN&gt;&lt;A href="http://support.sas.com/kb/34/addl/fusion_34270_1_oversample.wmv"&gt;&lt;SPAN style="font-size: 10pt; font-family: Arial, sans-serif; color: #0066cc;"&gt;video&lt;/SPAN&gt;&lt;/A&gt;&lt;/EM&gt;&lt;SPAN class="apple-converted-space"&gt;&lt;EM&gt; &lt;/EM&gt;&lt;/SPAN&gt;&lt;EM style="color: #333333; font-size: 10.0pt; font-family: 'Arial','sans-serif';"&gt;for a demonstration&lt;/EM&gt;&lt;SPAN style="font-size: 10.0pt; font-family: 'Arial','sans-serif'; color: #333333;"&gt;)&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 10.0pt; font-family: 'Arial','sans-serif'; color: #333333;"&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 10.0pt; font-family: 'Arial','sans-serif'; color: #333333;"&gt;I hope this helps!&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-size: 10.0pt; font-family: 'Arial','sans-serif'; color: #333333;"&gt;Miguel&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 08 May 2014 14:39:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/True-Positive-is-0-Oversampling-adjusted-prior-weight-Need-a/m-p/149339#M1496</guid>
      <dc:creator>M_Maldonado</dc:creator>
      <dc:date>2014-05-08T14:39:54Z</dc:date>
    </item>
    <item>
      <title>Re: True Positive is 0. Oversampling, adjusted prior, weight.  Need a help!</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/True-Positive-is-0-Oversampling-adjusted-prior-weight-Need-a/m-p/149340#M1497</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #333333; font-size: 9pt;"&gt;Hi Miguel,&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #333333; font-size: 9pt;"&gt;I really appreciate your advice.&amp;nbsp; Below is the result of your suggestions:&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="text-indent: -0.25in; padding-left: 45px;"&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #333333; font-size: 9pt;"&gt;&lt;SPAN class="pasted-list-info"&gt;A)&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #333333; font-size: 9pt;"&gt;Gradient Boosting.&amp;nbsp;&amp;nbsp; (Data -&amp;gt; Gradient Boosting)&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #333333; font-size: 9pt;"&gt;Result:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;TABLE border="0" cellpadding="0" cellspacing="0" style="width: 573px;"&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD height="20" width="56"&gt;Model&lt;/TD&gt;&lt;TD&gt;&lt;/TD&gt;&lt;TD&gt;&lt;/TD&gt;&lt;TD&gt;Misclassification rate&lt;/TD&gt;&lt;TD&gt;ASE&lt;/TD&gt;&lt;TD&gt;ROC&lt;/TD&gt;&lt;TD&gt;Gini Coef.&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD height="20"&gt;Y&lt;/TD&gt;&lt;TD&gt;Boost&lt;/TD&gt;&lt;TD&gt;Gradient Boosting&lt;/TD&gt;&lt;TD align="right"&gt;0.005701214&lt;/TD&gt;&lt;TD align="right"&gt;0.00566871&lt;/TD&gt;&lt;TD align="right"&gt;0.5&lt;/TD&gt;&lt;TD align="right"&gt;0&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;TABLE border="1" cellpadding="0" cellspacing="0" style="padding-bottom: 0px; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0px;" width="628"&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD nowrap="nowrap" style="padding-left: 5.4pt; padding-right: 5.4pt; border: windowtext 1pt solid;" valign="top" width="56"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;Model&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD nowrap="nowrap" rowspan="2" style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: windowtext 1pt solid; border-right: windowtext 1pt solid;" valign="top" width="128"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;Model Description&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD nowrap="nowrap" rowspan="2" style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: windowtext 1pt solid; border-right: windowtext 1pt solid;" valign="top" width="51"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;Data&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;Role&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD nowrap="nowrap" rowspan="2" style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: windowtext 1pt solid; border-right: windowtext 1pt solid;" valign="top" width="108"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;Target&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD nowrap="nowrap" rowspan="2" style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: windowtext 1pt solid; border-right: windowtext 1pt solid;" valign="top" width="77"&gt;&lt;P align="center" style="text-align: center;"&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;FALSE&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;Negative&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD nowrap="nowrap" rowspan="2" style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: windowtext 1pt solid; border-right: windowtext 1pt solid;" valign="top" width="67"&gt;&lt;P align="center" style="text-align: center;"&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;TRUE&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;Negative&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD nowrap="nowrap" rowspan="2" style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: windowtext 1pt solid; border-right: windowtext 1pt solid;" valign="top" width="76"&gt;&lt;P align="center" style="text-align: center;"&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;FALSE&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;Positive&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD nowrap="nowrap" rowspan="2" style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: windowtext 1pt solid; border-right: windowtext 1pt solid;" valign="top" width="64"&gt;&lt;P align="center" style="text-align: center;"&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;TRUE&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;Positive&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD nowrap="nowrap" style="border-bottom: windowtext 1pt solid; border-left: windowtext 1pt solid; padding-left: 5.4pt; padding-right: 5.4pt; border-top: medium none; border-right: windowtext 1pt solid;" valign="top" width="56"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;Node&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD nowrap="nowrap" style="border-bottom: windowtext 1pt solid; border-left: windowtext 1pt solid; padding-left: 5.4pt; padding-right: 5.4pt; border-top: medium none; border-right: windowtext 1pt solid;" valign="top" width="56"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;Boost&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD nowrap="nowrap" style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: medium none; border-right: windowtext 1pt solid;" valign="top" width="128"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;Gradient Boosting&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD nowrap="nowrap" style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: medium none; border-right: windowtext 1pt solid;" valign="top" width="51"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;TRAIN&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD nowrap="nowrap" style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: medium none; border-right: windowtext 1pt solid;" valign="top" width="108"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;RESPONSE_IND&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD nowrap="nowrap" style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: medium none; border-right: windowtext 1pt solid;" valign="top" width="77"&gt;&lt;P align="right" style="text-align: right;"&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;6445&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD nowrap="nowrap" style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: medium none; border-right: windowtext 1pt solid;" valign="top" width="67"&gt;&lt;P align="right" style="text-align: right;"&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;1124016&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD nowrap="nowrap" style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: medium none; border-right: windowtext 1pt solid;" valign="top" width="76"&gt;&lt;P align="right" style="text-align: right;"&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;0&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD nowrap="nowrap" style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: medium none; border-right: windowtext 1pt solid;" valign="top" width="64"&gt;&lt;P align="right" style="text-align: right;"&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 11pt;"&gt;0&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #333333; font-size: 9pt;"&gt;This model classified all cases as non-response – just like other models that I built – because my response rate is (0.0057 or 6445 cases) and non-response is (0.9943 or 1124016) and True Positive and False Positive are both zero.&amp;nbsp; &lt;/SPAN&gt;&lt;/P&gt;&lt;P style="text-indent: -0.25in; padding-left: 45px;"&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #333333; font-size: 9pt;"&gt;&lt;SPAN class="pasted-list-info"&gt;B)&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #333333; font-size: 9pt;"&gt;Ensemble of Reg2 and Reg3 (using 10% oversampled data) &lt;/SPAN&gt;&lt;/P&gt;&lt;P style="text-indent: -0.25in; padding-left: 45px;"&gt;&lt;/P&gt;&lt;TABLE border="1" cellpadding="0" cellspacing="0" style="padding-bottom: 0px; padding-left: 5.4pt; padding-right: 5.4pt; table-layout: fixed; margin-left: 0.9pt; padding-top: 0px;"&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD style="padding-left: 5.4pt; padding-right: 5.4pt; border: windowtext 1pt solid;" valign="top" width="68"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Selected&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Model&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: windowtext 1pt solid; border-right: windowtext 1pt solid;" valign="top" width="69"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Model&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Node&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: windowtext 1pt solid; border-right: windowtext 1pt solid;" valign="top" width="69"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Valid:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Average&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Profit for&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;RESPONSE_IND&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: windowtext 1pt solid; border-right: windowtext 1pt solid;" valign="top" width="74"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Train:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Average&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Squared&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Error&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: windowtext 1pt solid; border-right: windowtext 1pt solid;" valign="top" width="64"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Train:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Misclassification&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Rate&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: windowtext 1pt solid; border-right: windowtext 1pt solid;" valign="top" width="80"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Valid:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Average&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Squared&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Error&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: windowtext 1pt solid; border-right: windowtext 1pt solid;" valign="top" width="72"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Valid:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Misclassification&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Rate&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD style="border-bottom: windowtext 1pt solid; border-left: windowtext 1pt solid; padding-left: 5.4pt; padding-right: 5.4pt; border-top: medium none; border-right: windowtext 1pt solid;" valign="top" width="68"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Y&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: medium none; border-right: windowtext 1pt solid;" valign="top" width="69"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Reg2&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: medium none; border-right: windowtext 1pt solid;" valign="top" width="69"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;1.71&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;0.090012&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/TD&gt;&lt;TD&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;0.10002&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/TD&gt;&lt;TD&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;0.089988&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/TD&gt;&lt;TD&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;0.099984&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD style="border-bottom: windowtext 1pt solid; border-left: windowtext 1pt solid; padding-left: 5.4pt; padding-right: 5.4pt; border-top: medium none; border-right: windowtext 1pt solid;" valign="top" width="68"&gt;&lt;P align="right" style="text-align: right;"&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: medium none; border-right: windowtext 1pt solid;" valign="top" width="69"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Reg3&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: medium none; border-right: windowtext 1pt solid;" valign="top" width="69"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;1.71&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;0.090012&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/TD&gt;&lt;TD&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;0.10002&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/TD&gt;&lt;TD&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;0.089988&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/TD&gt;&lt;TD&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;0.099984&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD style="border-bottom: windowtext 1pt solid; border-left: windowtext 1pt solid; padding-left: 5.4pt; padding-right: 5.4pt; border-top: medium none; border-right: windowtext 1pt solid;" valign="top" width="68"&gt;&lt;P align="right" style="text-align: right;"&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: medium none; border-right: windowtext 1pt solid;" valign="top" width="69"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;Ensmbl&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD style="border-bottom: windowtext 1pt solid; border-left: medium none; padding-left: 5.4pt; padding-right: 5.4pt; border-top: medium none; border-right: windowtext 1pt solid;" valign="top" width="69"&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;1.71&lt;/SPAN&gt;&lt;/P&gt;&lt;/TD&gt;&lt;TD&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;0.098908&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/TD&gt;&lt;TD&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;0.10002&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/TD&gt;&lt;TD&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;0.098877&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/TD&gt;&lt;TD&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif'; color: black; font-size: 8pt;"&gt;0.099984&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P style="text-indent: -0.25in; padding-left: 45px;"&gt;&lt;/P&gt;&lt;P style="text-indent: -0.25in; padding-left: 45px;"&gt;&lt;/P&gt;&lt;P style="text-indent: -0.25in; padding-left: 45px;"&gt;&lt;/P&gt;&lt;P style="text-indent: -0.25in; padding-left: 45px;"&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #333333; font-size: 9pt;"&gt;&lt;SPAN class="pasted-list-info"&gt;C)&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #333333; font-size: 9pt;"&gt;Bagging, Boosting, and Rotation Forest.&amp;nbsp; &lt;/SPAN&gt;&lt;/P&gt;&lt;P style="text-indent: -0.25in; padding-left: 45px;"&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #333333; font-size: 9pt;"&gt;This paper you suggested me is great!!&amp;nbsp; I am still working on this part since I have not used these methods.&amp;nbsp; I will let know.&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #333333; font-size: 9pt;"&gt;&lt;STRONG&gt;Findings from my search&lt;/STRONG&gt; ( I am not sure how reliable they are)&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #333333; font-size: 9pt;"&gt;I was reading SAS support page and they said, “&lt;/SPAN&gt;&lt;SPAN lang="EN" style="font-family: 'Arial','sans-serif'; color: black; font-size: 10pt;"&gt;Over-weighting or under-sampling can improve predictive accuracy when there &lt;/SPAN&gt;&lt;SPAN lang="EN" style="font-family: 'Arial','sans-serif'; color: red; font-size: 10pt;"&gt;are three or more classes&lt;/SPAN&gt;&lt;SPAN lang="EN" style="font-family: 'Arial','sans-serif'; color: black; font-size: 10pt;"&gt;, including at least one rare class and two or more common classes.”&amp;nbsp; My data has two classes, so I guess it is true for my case because &lt;SPAN style="color: #333333;"&gt;my oversampling (omitting cases from common classes) is not helping to obtain the best model regardless of sample sizes.&amp;nbsp; However, when I don't put adjusted prior&amp;nbsp; (using the prior of sample data), my misclassifications increases as I increase sample size as well as the number of True Positive.&amp;nbsp; I care more about the number of True Positive rather than errors.&amp;nbsp; Is it wrong if I keep current prior probablity(not using adjusted prior)?&amp;nbsp; &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN lang="EN" style="font-family: 'Arial','sans-serif'; color: black; font-size: 10pt;"&gt;Below is the link to the website. &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="http://support.sas.com/documentation/cdl/en/emxndg/64759/HTML/default/viewer.htm#p1w6fewo0jhzxdn1rytuk1kt0pqj.htm"&gt;&lt;SPAN lang="EN" style="font-family: 'Arial','sans-serif'; font-size: 10pt;"&gt;http://support.sas.com/documentation/cdl/en/emxndg/64759/HTML/default/viewer.htm#p1w6fewo0jhzxdn1rytuk1kt0pqj.htm&lt;/SPAN&gt;&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #333333; font-size: 9pt;"&gt;The other thing is how to evaluate models when you need to detect the rare event.&amp;nbsp; &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;OL style="list-style-type: upper-alpha;"&gt;&lt;LI&gt;&lt;STRONG&gt;&lt;SPAN style="color: #cd0000; font-size: 9.5pt; font-family: 'Arial,BoldItalic','sans-serif';"&gt;&lt;EM&gt;Detection rate (Recall) &lt;/EM&gt;&lt;/SPAN&gt;&lt;SPAN style="font-family: 'Arial,Bold','sans-serif'; color: black; font-size: 9.5pt;"&gt;- ratio between the number of correctly detected &lt;/SPAN&gt;&lt;SPAN style="font-family: 'Arial,Bold','sans-serif'; color: mediumblue; font-size: 9.5pt;"&gt;rare events &lt;/SPAN&gt;&lt;SPAN style="font-family: 'Arial,Bold','sans-serif'; color: black; font-size: 9.5pt;"&gt;and the total number of &lt;/SPAN&gt;&lt;SPAN style="font-family: 'Arial,Bold','sans-serif'; color: mediumblue; font-size: 9.5pt;"&gt;rare events&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;&lt;SPAN style="color: #cd0000; font-size: 9.5pt; font-family: 'Arial,BoldItalic','sans-serif';"&gt;&lt;EM&gt;False alarm (false positive) rate &lt;/EM&gt;&lt;/SPAN&gt;&lt;SPAN style="font-family: 'Arial,Bold','sans-serif'; color: black; font-size: 9.5pt;"&gt;– ratio between the number of data records from majority class that are misclassified as rare events and the total number of data records from majority class&lt;/SPAN&gt;&lt;/STRONG&gt;.&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;&lt;SPAN style="color: #cd0000; font-size: 9.5pt; font-family: 'Arial,BoldItalic','sans-serif';"&gt;&lt;EM&gt;ROC Curve &lt;/EM&gt;&lt;/SPAN&gt;&lt;SPAN style="font-family: 'Arial,Bold','sans-serif'; color: black; font-size: 9.5pt;"&gt;is a trade-off between detection rate and false alarm rate&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #333333; font-size: 9pt;"&gt;so Missclassification, ASE, Average Profit , etc are &lt;/SPAN&gt;not sufficient metric for evaluation when rare event??&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Arial','sans-serif'; color: #333333; font-size: 9pt;"&gt;Here is the title of article:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;"Data Mining for Analysis of Rare Events:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P align="left"&gt;A Case Study in Security, Financial and&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Medical Applications"&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks, Miguel!&amp;nbsp;&amp;nbsp; I really appreciate your help.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 08 May 2014 21:03:32 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/True-Positive-is-0-Oversampling-adjusted-prior-weight-Need-a/m-p/149340#M1497</guid>
      <dc:creator>kkasahar</dc:creator>
      <dc:date>2014-05-08T21:03:32Z</dc:date>
    </item>
  </channel>
</rss>

