<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Use of ASE to assess model complexity in SAS Academy for Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Use-of-ASE-to-assess-model-complexity/m-p/647965#M784</link>
    <description>&lt;P&gt;Just a further clarification on the statement "&lt;SPAN style="display: inline !important; float: none; background-color: #f3ffeb; color: #333333; font-family: Arial,Helvetica,sans-serif; font-size: 14px; font-style: normal; font-variant: normal; font-weight: 300; letter-spacing: normal; line-height: 150%; orphans: 2; text-align: left; text-decoration: none; text-indent: 0px; text-transform: none; -webkit-text-stroke-width: 0px; white-space: normal; word-spacing: 0px;"&gt;because adjusting for prior probability basically only shifting the intercept values&lt;/SPAN&gt;": that is true for linear models (i.e. Logistic Regression); but what about non-parametric or non-linear models such as Decision Trees and Neural Networks? Would that still just result in a shift of the intercept values?&lt;/P&gt;</description>
    <pubDate>Fri, 15 May 2020 05:10:42 GMT</pubDate>
    <dc:creator>pvareschi</dc:creator>
    <dc:date>2020-05-15T05:10:42Z</dc:date>
    <item>
      <title>Use of ASE to assess model complexity</title>
      <link>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Use-of-ASE-to-assess-model-complexity/m-p/644752#M711</link>
      <description>&lt;P&gt;Perhaps this is something well known to everybody but to me it has been a surprise: after doing some tests on fitting models with and without defining prior probabilities I have noticed that Enterprise Miner does not take account of prior probabilities when calculating Average Square Error (ASE) (same applies to the calculation of residuals as saved on output datasets)&lt;/P&gt;&lt;P&gt;&lt;FONT&gt;That being the case, I just want to clarify whether &lt;FONT&gt;there is any chance/scenario under which we would end up choosing a different model (out of a sequence of models of increasing complexity - e.g. Regression) if ASE was indeed adjusted for prior probabilities.&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT&gt;My instinct tells me that is not the case, but I wonder whether there is a more mathematical justification for that.&lt;BR /&gt;&lt;/FONT&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 02 May 2020 18:05:11 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Use-of-ASE-to-assess-model-complexity/m-p/644752#M711</guid>
      <dc:creator>pvareschi</dc:creator>
      <dc:date>2020-05-02T18:05:11Z</dc:date>
    </item>
    <item>
      <title>Re: Use of ASE to assess model complexity</title>
      <link>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Use-of-ASE-to-assess-model-complexity/m-p/644942#M717</link>
      <description>&lt;P&gt;Just to further clarify, I am referring to "&lt;FONT&gt;Applied Analytics Using SAS Enterprise Miner&lt;/FONT&gt;", "&lt;FONT&gt;Lesson 7: Model Assessment Using SAS Enterprise Miner&lt;/FONT&gt;", "&lt;FONT&gt;Adjusting for Separate Sampling&lt;/FONT&gt;": if we do not specify prior probabilities, we know that performance metrics are inaccurate and/or biased; however, what I am concerned is whether it would affect the choice of the "best model", especially when applied to a single modelling node to assess model complexity. My understanding is that it would not be the case, at least when using ASE or misclassification rate (Profit/Loss would be a different matter)&lt;/P&gt;</description>
      <pubDate>Mon, 04 May 2020 09:16:13 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Use-of-ASE-to-assess-model-complexity/m-p/644942#M717</guid>
      <dc:creator>pvareschi</dc:creator>
      <dc:date>2020-05-04T09:16:13Z</dc:date>
    </item>
    <item>
      <title>Re: Use of ASE to assess model complexity</title>
      <link>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Use-of-ASE-to-assess-model-complexity/m-p/645696#M731</link>
      <description>&lt;P&gt;I agree with your comments because adjusting for prior probability basically only shifting the intercept values. Therefore this should not affect the model selection. However, if you want the prior values affect your model decision you should consider the decision option and provide decision weights (Please refer Chapter 6 in the AAEM course notes)&lt;/P&gt;</description>
      <pubDate>Wed, 06 May 2020 18:44:00 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Use-of-ASE-to-assess-model-complexity/m-p/645696#M731</guid>
      <dc:creator>gcjfernandez</dc:creator>
      <dc:date>2020-05-06T18:44:00Z</dc:date>
    </item>
    <item>
      <title>Re: Use of ASE to assess model complexity</title>
      <link>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Use-of-ASE-to-assess-model-complexity/m-p/647965#M784</link>
      <description>&lt;P&gt;Just a further clarification on the statement "&lt;SPAN style="display: inline !important; float: none; background-color: #f3ffeb; color: #333333; font-family: Arial,Helvetica,sans-serif; font-size: 14px; font-style: normal; font-variant: normal; font-weight: 300; letter-spacing: normal; line-height: 150%; orphans: 2; text-align: left; text-decoration: none; text-indent: 0px; text-transform: none; -webkit-text-stroke-width: 0px; white-space: normal; word-spacing: 0px;"&gt;because adjusting for prior probability basically only shifting the intercept values&lt;/SPAN&gt;": that is true for linear models (i.e. Logistic Regression); but what about non-parametric or non-linear models such as Decision Trees and Neural Networks? Would that still just result in a shift of the intercept values?&lt;/P&gt;</description>
      <pubDate>Fri, 15 May 2020 05:10:42 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Use-of-ASE-to-assess-model-complexity/m-p/647965#M784</guid>
      <dc:creator>pvareschi</dc:creator>
      <dc:date>2020-05-15T05:10:42Z</dc:date>
    </item>
    <item>
      <title>Re: Use of ASE to assess model complexity</title>
      <link>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Use-of-ASE-to-assess-model-complexity/m-p/648324#M789</link>
      <description>&lt;DIV class="lia-quilt-row lia-quilt-row-main"&gt;
&lt;DIV class="lia-quilt-column lia-quilt-column-24 lia-quilt-column-single lia-quilt-column-main"&gt;
&lt;DIV class="lia-quilt-column-alley lia-quilt-column-alley-single"&gt;
&lt;DIV class="forum-topic-flex-article"&gt;
&lt;DIV class="forum-article"&gt;
&lt;DIV class="forum-post"&gt;
&lt;DIV id="bodyDisplay_76ca72f42e3375" class="lia-message-body lia-component-message-view-widget-body lia-component-body-signature-highlight-escalation lia-component-message-view-widget-body-signature-highlight-escalation"&gt;
&lt;DIV class="lia-message-body-content"&gt;
&lt;P&gt;Your question:&lt;/P&gt;
&lt;P&gt;Just a further clarification on the statement "&lt;SPAN&gt;because adjusting for prior probability basically only shifting the intercept values&lt;/SPAN&gt;": that is true for linear models (i.e. Logistic Regression); but what about non-parametric or non-linear models such as Decision Trees and Neural Networks? Would that still just result in a shift of the intercept values?&lt;/P&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;DIV id="kudosButtonV2" class="KudosButton lia-button-image-kudos-wrapper lia-component-kudos-widget-button-version-3 lia-component-kudos-widget-button-horizontal lia-component-kudos-widget-button lia-component-kudos-action lia-component-message-view-widget-kudos-action" data-lia-kudos-id="647965"&gt;
&lt;DIV class="lia-button-image-kudos lia-button-image-kudos-horizontal lia-button-image-kudos-enabled lia-button-image-kudos-not-kudoed lia-button"&gt;
&lt;DIV class="lia-button-image-kudos-give" title="Click here to give likes to this post."&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;My answer:&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;P&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;When the target variable is binary we call this predictive model a classification model and the goal of Decision tree, Logistic regression, or NN is to classy the binary target correctly. All these models create&amp;nbsp; all possible pairs of one event and one non event and if these models correctly classify one pair at a time then they are called concordance pair. Otherwise discordance pair. Therefore by random chance there is 50% chance of finding&amp;nbsp; the event within a pair. We hope that the model we develop will have a higher chance of differentiating&amp;nbsp;event from the non event.&amp;nbsp; These statistics (% of concordance&amp;nbsp;and discordance) are the basis of ROC index. ROC index is not influenced by the Prior probability. Therefore ROC index is a popular model comparison statistics. Also the proportion&amp;nbsp;of events to non events in the population is not considered when developing classification models by default.&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&lt;FONT color="#339966"&gt;However, if the goal of scoring is computing posterior probabilities, then the posterior probabilities needs to be adjusted for prior probability after we develop the model. This adjustment will be the same whether we use Decision&amp;nbsp;tree (base line adjustment ), logistic regression or NN(Intercept, offset, bias). Because in this prior probability adjustment (non-linear component of the model is not included).&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;FONT color="#0000FF"&gt;I hope this explanation&amp;nbsp;is adequate&lt;/FONT&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 16 May 2020 17:37:59 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Use-of-ASE-to-assess-model-complexity/m-p/648324#M789</guid>
      <dc:creator>gcjfernandez</dc:creator>
      <dc:date>2020-05-16T17:37:59Z</dc:date>
    </item>
    <item>
      <title>Re: Use of ASE to assess model complexity</title>
      <link>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Use-of-ASE-to-assess-model-complexity/m-p/649613#M807</link>
      <description>&lt;P&gt;Thank you for your explanation; very thorough!&lt;/P&gt;</description>
      <pubDate>Thu, 21 May 2020 15:38:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Use-of-ASE-to-assess-model-complexity/m-p/649613#M807</guid>
      <dc:creator>pvareschi</dc:creator>
      <dc:date>2020-05-21T15:38:43Z</dc:date>
    </item>
  </channel>
</rss>

