<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Use of oversampling and cut-off node results interpretation in SAS E-Miner. in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Use-of-oversampling-and-cut-off-node-results-interpretation-in/m-p/100346#M830</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have a dataset with target to non target (target variable is binary) proportion as 20:80 with some variables which are to be used in decision tree analysis. Count of data rows is around 20,000. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;To start with, I ran the decision tree with no adjusted priors. However, I couldn't see any results from the run (no tree map with subsequent partitions, no cumulative lift chart etc). Hence I thought of trying out an oversampling such that the new proportion of target to non target is now 50:50. The decision tree this time threw up results unlike the earlier scenario. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;My questions are: &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;1) Is there a reason why the tree output with no adjusted priors wasn't created ? I am using the default decision tree node settings itself.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;2) How to decide on the cut-off (percent of Y or 1) for the overall tree, so that I can pick and choose the important leaf nodes? Or should it be be fixed at 50% (as I have oversampled data to 50:50 from 20:80) , such that all leaf nodes above 50% have higher probability of prediction for target = Y or 1. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;3) Cut-off node usage: I read in a tech paper that cut-off node can be attached to any model node (where target is binary) and the actual cut-off can be determined from some tables of its results. Any leads on how these are being read. Is there a way to ascertain which cut-off other than 50% is better for my model.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;4) Is there a choice between deciding a cut-off basis oversampling (pt # 2) or from the results of cut-off node ?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Kindly advise. &lt;/P&gt;&lt;P&gt;,&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Sat, 08 Jun 2013 07:10:56 GMT</pubDate>
    <dc:creator>ajosh</dc:creator>
    <dc:date>2013-06-08T07:10:56Z</dc:date>
    <item>
      <title>Use of oversampling and cut-off node results interpretation in SAS E-Miner.</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Use-of-oversampling-and-cut-off-node-results-interpretation-in/m-p/100346#M830</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have a dataset with target to non target (target variable is binary) proportion as 20:80 with some variables which are to be used in decision tree analysis. Count of data rows is around 20,000. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;To start with, I ran the decision tree with no adjusted priors. However, I couldn't see any results from the run (no tree map with subsequent partitions, no cumulative lift chart etc). Hence I thought of trying out an oversampling such that the new proportion of target to non target is now 50:50. The decision tree this time threw up results unlike the earlier scenario. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;My questions are: &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;1) Is there a reason why the tree output with no adjusted priors wasn't created ? I am using the default decision tree node settings itself.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;2) How to decide on the cut-off (percent of Y or 1) for the overall tree, so that I can pick and choose the important leaf nodes? Or should it be be fixed at 50% (as I have oversampled data to 50:50 from 20:80) , such that all leaf nodes above 50% have higher probability of prediction for target = Y or 1. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;3) Cut-off node usage: I read in a tech paper that cut-off node can be attached to any model node (where target is binary) and the actual cut-off can be determined from some tables of its results. Any leads on how these are being read. Is there a way to ascertain which cut-off other than 50% is better for my model.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;4) Is there a choice between deciding a cut-off basis oversampling (pt # 2) or from the results of cut-off node ?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Kindly advise. &lt;/P&gt;&lt;P&gt;,&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sat, 08 Jun 2013 07:10:56 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Use-of-oversampling-and-cut-off-node-results-interpretation-in/m-p/100346#M830</guid>
      <dc:creator>ajosh</dc:creator>
      <dc:date>2013-06-08T07:10:56Z</dc:date>
    </item>
    <item>
      <title>Re: Use of oversampling and cut-off node results interpretation in SAS E-Miner.</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Use-of-oversampling-and-cut-off-node-results-interpretation-in/m-p/388272#M5829</link>
      <description>&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;There are several issues involved here that need to be separated in order to provide a clearer understanding. &amp;nbsp; &amp;nbsp;For categorical target variables, by default SAS Enterprise Miner assigns the observation to the most likely target level based on the predicted value stored in a variable of the form&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;&amp;nbsp; &lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;&amp;nbsp; &amp;nbsp;&amp;nbsp;P_&amp;lt;target variable name&amp;gt;&amp;lt;target variable level&amp;gt; &lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;Using the SAMPSIO.HMEQ data set (which is available by clicking on &lt;STRONG&gt;Help&lt;/STRONG&gt; --&amp;gt; &lt;STRONG&gt;Generate Sample Data Sources...&lt;/STRONG&gt; inside Enterprise Miner and adding the &lt;STRONG&gt;Home Equity&lt;/STRONG&gt; data) as an example, there is a categorical target variable named BAD which has levels 1 and 0. SAS Enterprise Miner generates several variables from any modeling node and in this case it would create the variables &lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp;P_BAD1 &amp;nbsp;- &amp;nbsp;the probability BAD=1 &lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;P_BAD0 &amp;nbsp;- &amp;nbsp;the probability BAD=0&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;These probabilities will reflect the training data by default so the probabilities of the rare event will be inflated if you oversample so that there is a higher proportion of observations with BAD=1 (the rare event) in the sample than there is in the population. &amp;nbsp;&amp;nbsp;If you are only concerned about the the predicted outcome, you can simply adjust the cutoff probabilities later using a Cutoff node to get the desired proportion of the data classified as events.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;Should you be more interested in the actual probabilities themselves (rather than just the ordering of the observations from most likely to least likely) and wish to have the probability scores reflect values closer to the original population rather than the training data, you can accomplish this by creating a Target profile in the Input Data Source node. &amp;nbsp;&amp;nbsp;A Target profile allows you to adjust the prior probability and the weight/value attached to correctly predicting each outcome. &amp;nbsp;Adjusting the prior probability for an oversampled target will adjust the probability scores to be centered closer to the overall population average you provide. &amp;nbsp;Depending on which criteria you are using for choosing the model, it might also be useful to apply additional weight/value to correctly predicting the rare event. &amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;By default, SAS Enterprise Miner defines two variables for a grouping variable target.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;&amp;nbsp; &amp;nbsp; F_ &amp;lt;variable name&amp;gt;: &amp;nbsp;the target level for each observation&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;&amp;nbsp; &amp;nbsp; I _ &amp;lt;variable name&amp;gt;: &amp;nbsp;the predicted target level based on the fitted model (based on most likely outcome)&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;When you request to use decision weights in your target profile, SAS Enterprise Miner will create a decis&lt;/SPAN&gt;&lt;SPAN class="cs1B16EEB5"&gt;ion variable of the form&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;&amp;nbsp; &amp;nbsp; D _ &amp;lt;target variable name&amp;gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;with the predicted outcome based on choosing the most profitable (or least costly) outcome from the product of the predicted probability and the decision weight for each level. &amp;nbsp; I_&amp;lt;variable name&amp;gt; and D_&amp;lt;variable name&amp;gt; provide reasonable approaches in many situations but in rare event scenarios, the I_ variable will likely predict too few people as having the event and the D_ variable will predict too many. &amp;nbsp;As a result, I generally advise people to take their business objectives into consideration in order to choose a cutoff for their particular data set. &amp;nbsp; Without specifying decisions weights, you might end up with a tree with no branches if none of the leaves represent a higher probability for the rare event. &amp;nbsp;As a result, it is often helpful to specify your priors and decision weights.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;It is easy to accomplish this task by following the instructions in &lt;/SPAN&gt;&lt;SPAN class="csDC4A80"&gt;Usage Note 47965: Using priors and decision weights in SAS® Enterprise Miner(tm)&lt;/SPAN&gt;&lt;SPAN class="cs1B16EEB5"&gt; which is available at&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;A class="csB4F56505" href="http://support.sas.com/kb/47/965.html" target="new"&gt;&lt;SPAN class="cs777ED152"&gt;http://support.sas.com/kb/47/965.html&lt;/SPAN&gt;&lt;/A&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;where it shows the following:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;/*** BEGIN USAGE NOTE 47965 EXCERPT ***/&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs887C2D5B"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="csF7F83909"&gt;&lt;SPAN class="csAEFEEABF"&gt;Data mining problems routinely involve situations where one target level is more "rare" than others. By default, SAS Enterprise Miner assigns the most likely outcome as the predicted outcome. This assignment results in decision rules that strongly favor the common outcome, which is usually not of interest. The assignment often generates models with no predicted events of interest.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="csF7F83909"&gt;&lt;SPAN class="csAEFEEABF"&gt;If you specify priors, then the posterior probabilities are adjusted, but the adjustment might lead to no variables selected. Even if a model is successfully fit, the predicted outcome might be the common target level.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="csF7F83909"&gt;&lt;SPAN class="csAEFEEABF"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="csF7F83909"&gt;&lt;SPAN class="csAEFEEABF"&gt;Example: an event occurs 1% of the time. A person who is 10 times as likely to have the event, still has only a 10% chance of having the event. &amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="csAEFEEABF"&gt;You can change this prediction outcome by modifying the default decision weights. Edit the default decision weights either in a Decisions node, or in an Input Data node. &lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="csF7F83909"&gt;&lt;SPAN class="csAEFEEABF"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="csF7F83909"&gt;&lt;SPAN class="csAEFEEABF"&gt;To edit the default decision weights in the Input Data node, follow these steps:&lt;/SPAN&gt;&lt;/P&gt;
&lt;OL start="1"&gt;
&lt;LI class="csC3FAB670"&gt;&lt;SPAN class="csAEFEEABF"&gt;Click the Input Data node.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI class="csC3FAB670"&gt;&lt;SPAN class="csAEFEEABF"&gt;Click the ellipsis (&lt;/SPAN&gt;&lt;SPAN class="cs489B9A2F"&gt;...&lt;/SPAN&gt;&lt;SPAN class="csAEFEEABF"&gt;) to the right of the Decisions property.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI class="csC3FAB670"&gt;&lt;SPAN class="csAEFEEABF"&gt;Click&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="cs489B9A2F"&gt;Build&lt;/SPAN&gt;&lt;SPAN class="csAEFEEABF"&gt;&amp;nbsp;to create a target profile.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI class="csC3FAB670"&gt;&lt;SPAN class="csAEFEEABF"&gt;Click the&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="cs489B9A2F"&gt;Decisions&lt;/SPAN&gt;&lt;SPAN class="csAEFEEABF"&gt;&amp;nbsp;tab.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI class="cs49B1FA62"&gt;&lt;SPAN class="cs760720A3"&gt;Click&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="cs94D94857"&gt;Default with Inverse Prior Weights&lt;/SPAN&gt;&lt;SPAN class="cs760720A3"&gt;. This selection enables you to find variables that are useful predictors.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI class="csC3FAB670"&gt;&lt;SPAN class="csAEFEEABF"&gt;Click&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="cs489B9A2F"&gt;Decision Weights&lt;/SPAN&gt;&lt;SPAN class="csAEFEEABF"&gt;&amp;nbsp;to see that the values changed from their default values.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI class="csC3FAB670"&gt;&lt;SPAN class="cs489B9A2F"&gt;OK&lt;/SPAN&gt;&lt;SPAN class="csAEFEEABF"&gt;.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/OL&gt;
&lt;P class="cs239C706E"&gt;&lt;SPAN class="csAEFEEABF"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="csF7F83909"&gt;&lt;SPAN class="csAEFEEABF"&gt;To determine the amount of weight to assign to the rare event in a binary target, calculate this ratio:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="csF7F83909"&gt;&lt;SPAN class="csAEFEEABF"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs1642BFDB"&gt;&lt;SPAN class="csC63A3429"&gt;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;probability of the common event&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs1642BFDB"&gt;&lt;SPAN class="csC63A3429"&gt;&amp;nbsp; &amp;nbsp;&amp;nbsp;ratio = ---------------------------------&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs1642BFDB"&gt;&lt;SPAN class="csC63A3429"&gt;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;probability of the rare event &amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs1642BFDB"&gt;&lt;SPAN class="csC63A3429"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="csAEFEEABF"&gt;Specify the weight on the rare event to be equal to this ratio. For example, if you have a binary event where Prob(Yes)=0.1 and Prob(No)=0.9, then the ratio of the common event to the rare event is 0.9/0.1 = 9. Change the weight for Yes from the default of 1 to the value 9 in the Decision Weights tab. If your rare event is much more rare, for example 2%, then the ratio is 0.98/0.2 = 49. If you have an event that occurs much less than 1% of the time, then you might get better results by over-sampling, and then adjusting the probabilities later. Even if you over-sample, the priors adjust the probabilities, but the predicted outcome is the common event (if you do not modify the decision weights).&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="csAEFEEABF"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="csF7F83909"&gt;&lt;SPAN class="csAEFEEABF"&gt;The choice of the predicted-probability value to choose as the cutoff for predicting an event or non-event relies on business expertise. In the case of a rare event, it is common to focus only on the predictions in the small range of values for which action is taken. A model that always predicts that the event is the common outcome gives the outcome as often as the common event occurs in the data (example: 95% of the time). SAS Enterprise Miner provides an automated choice that is based on the decision weights that you provide. If these weights do not represent how you expect to implement the results, then focus on the ordering of the probabilities, choose your own threshold for action.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="csF7F83909"&gt;&lt;SPAN class="csAEFEEABF"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="csF7F83909"&gt;&lt;SPAN class="csAEFEEABF"&gt;For more information, see the chapter "Predictive Modeling" in SAS Enterprise Miner Help.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="csAEFEEABF"&gt;Note: you might be able to apply this technique to a target variable that contains more than two levels. In that case, you need to specify how you want the levels to be weighted with respect to each other.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;/*** END USAGE NOTE 47965 EXCERPT ***/&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;You might also consider reviewing the paper Identifying and Overcoming Common Data Mining Mistake which is available at &lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;A class="csB4F56505" href="http://www2.sas.com/proceedings/forum2007/073-2007.pdf" target="new"&gt;&lt;SPAN class="cs777ED152"&gt;http://www2.sas.com/proceedings/forum2007/073-2007.pdf&lt;/SPAN&gt;&lt;/A&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;where it has a discussion of handling target variable event levels occurring in different proportions on the bottom of page 6. &amp;nbsp;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;I hope this helps!&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&lt;SPAN class="cs1B16EEB5"&gt;Doug&lt;/SPAN&gt;&lt;/P&gt;
&lt;P class="cs95E872D0"&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 15 Aug 2017 19:30:42 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Use-of-oversampling-and-cut-off-node-results-interpretation-in/m-p/388272#M5829</guid>
      <dc:creator>DougWielenga</dc:creator>
      <dc:date>2017-08-15T19:30:42Z</dc:date>
    </item>
  </channel>
</rss>

