<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Validation of Statistical Model Failed in SAS Programming</title>
    <link>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/609919#M177583</link>
    <description>&lt;P&gt;Sorry.&lt;/P&gt;
&lt;P&gt;For the second way, I mean NOT using option selection= .&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;proc logistic......&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;model .....;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;run;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;and check output:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Analysis of Maximum Likelihood Estimates&lt;/P&gt;
&lt;P&gt;Standard Wald&lt;BR /&gt;Parameter DF Estimate Error Chi-Square &lt;STRONG&gt;Pr &amp;gt; ChiSq&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Intercept 1 -6.4769 6.8880 0.8842 0.3471&lt;BR /&gt;Age 1 1.6308 0.9349 3.0425 &lt;STRONG&gt;0.0811&lt;/STRONG&gt;&lt;BR /&gt;Weight 1 -0.1569 0.0810 3.7507&lt;STRONG&gt; 0.0528&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;And drop all the variables which P Value &amp;gt; 0.05&amp;nbsp; by hand/manual .&lt;/STRONG&gt;&lt;/P&gt;</description>
    <pubDate>Fri, 06 Dec 2019 11:57:37 GMT</pubDate>
    <dc:creator>Ksharp</dc:creator>
    <dc:date>2019-12-06T11:57:37Z</dc:date>
    <item>
      <title>Validation of Statistical Model Failed</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/609906#M177577</link>
      <description>&lt;P&gt;I've got a statistical model with 27 Variables coming in. It was built on a Train dataset and tested on a Test dataset, I've tested it through a KS test and PSI. It fails the KS Test which I think shows that it's overfitted.&lt;/P&gt;
&lt;P&gt;I'm just not sure what I should do next?&lt;/P&gt;
&lt;P&gt;Should I start taking some of the variables out of the model? If so, how do I decide which ones? Do I let other variables come in instead of the ones I'm taking out?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 06 Dec 2019 10:27:53 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/609906#M177577</guid>
      <dc:creator>manonlyn</dc:creator>
      <dc:date>2019-12-06T10:27:53Z</dc:date>
    </item>
    <item>
      <title>Re: Validation of Statistical Model Failed</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/609913#M177580</link>
      <description>&lt;BLOCKQUOTE&gt;
&lt;P&gt;&lt;SPAN&gt;Should I start taking some of the variables out of the model?&lt;/SPAN&gt;&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&lt;SPAN&gt;Yes. 27 variables are way too many for a scorecard.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Generally , a scorecard contains 8-15 variables.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;drop insignificant variables have many ways:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;1)&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;proc logistic......&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;model ...../selection=stepwise&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;........&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;2) drop the P value &amp;gt; 0.05 variables in parameter estimator table.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;3)proc hpgenselect&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 06 Dec 2019 11:25:41 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/609913#M177580</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2019-12-06T11:25:41Z</dc:date>
    </item>
    <item>
      <title>Re: Validation of Statistical Model Failed</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/609917#M177582</link>
      <description>&lt;P&gt;Thanks for this!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I'm building the model using Proc Logistic, then in the output table "Summary of Stepwise Selection" there weren't any variables that had a Pr&amp;gt;Chi-Square value &amp;gt;0.05, is this the P Value?&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I have dropped a couple of the variables which has left me with 22 variables and the validation is slightly better but not fully validated.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I've not used&amp;nbsp;&lt;SPAN&gt;proc hpgenselect&amp;nbsp; before so I'll read up on that and how to use it to help my model build.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 06 Dec 2019 11:50:25 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/609917#M177582</guid>
      <dc:creator>manonlyn</dc:creator>
      <dc:date>2019-12-06T11:50:25Z</dc:date>
    </item>
    <item>
      <title>Re: Validation of Statistical Model Failed</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/609919#M177583</link>
      <description>&lt;P&gt;Sorry.&lt;/P&gt;
&lt;P&gt;For the second way, I mean NOT using option selection= .&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;proc logistic......&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;model .....;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;run;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;and check output:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Analysis of Maximum Likelihood Estimates&lt;/P&gt;
&lt;P&gt;Standard Wald&lt;BR /&gt;Parameter DF Estimate Error Chi-Square &lt;STRONG&gt;Pr &amp;gt; ChiSq&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Intercept 1 -6.4769 6.8880 0.8842 0.3471&lt;BR /&gt;Age 1 1.6308 0.9349 3.0425 &lt;STRONG&gt;0.0811&lt;/STRONG&gt;&lt;BR /&gt;Weight 1 -0.1569 0.0810 3.7507&lt;STRONG&gt; 0.0528&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;And drop all the variables which P Value &amp;gt; 0.05&amp;nbsp; by hand/manual .&lt;/STRONG&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 06 Dec 2019 11:57:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/609919#M177583</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2019-12-06T11:57:37Z</dc:date>
    </item>
    <item>
      <title>Re: Validation of Statistical Model Failed</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/609934#M177588</link>
      <description>&lt;P&gt;Is this the method I should use if I'm modelling categorical variables too?&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;As I will have maybe 4 different "bands" coming in for the class variables, so I null the band that has P&amp;gt;0.05 or do I null the whole variables if one band has that value?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks for your help with this, it's much appreciated.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 06 Dec 2019 12:55:49 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/609934#M177588</guid>
      <dc:creator>manonlyn</dc:creator>
      <dc:date>2019-12-06T12:55:49Z</dc:date>
    </item>
    <item>
      <title>Re: Validation of Statistical Model Failed</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/610037#M177625</link>
      <description>&lt;P&gt;1. How did you decide which variables to leave in your model? What model fitting approach did you use?&lt;/P&gt;
&lt;P&gt;2. Is your model designed to be explanatory or predictive?&lt;/P&gt;
&lt;P&gt;3. How many observations do you have? If you fit a model with 27 variables and 500 observations it will never work.&lt;/P&gt;
&lt;P&gt;4. For your categorical variable, how many levels do you have? A categorical variable with four levels is actually considered as 3 variables, since it requires 3 variables?&lt;/P&gt;
&lt;P&gt;5. What did you set as the parameterization method for your categorical variables?&lt;/P&gt;
&lt;P&gt;6. For the categorical variables, did you analyze them ahead of time to see if the levels make sense, or if they could be combined?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/275242"&gt;@manonlyn&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;P&gt;Thanks for this!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I'm building the model using Proc Logistic, then in the output table "Summary of Stepwise Selection" there weren't any variables that had a Pr&amp;gt;Chi-Square value &amp;gt;0.05, is this the P Value?&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I have dropped a couple of the variables which has left me with 22 variables and the validation is slightly better but not fully validated.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I've not used&amp;nbsp;&lt;SPAN&gt;proc hpgenselect&amp;nbsp; before so I'll read up on that and how to use it to help my model build.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 06 Dec 2019 17:09:26 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/610037#M177625</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2019-12-06T17:09:26Z</dc:date>
    </item>
    <item>
      <title>Re: Validation of Statistical Model Failed</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/610184#M177682</link>
      <description>&lt;BLOCKQUOTE&gt;
&lt;P&gt;&lt;SPAN&gt;do I null the whole variables if one band has that value?&lt;/SPAN&gt;&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Drop the whole variable whatever it is a category variable or a numeric .(i.e.&amp;nbsp; null the whole variable )&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 07 Dec 2019 10:32:03 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/610184#M177682</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2019-12-07T10:32:03Z</dc:date>
    </item>
    <item>
      <title>Re: Validation of Statistical Model Failed</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/610420#M177760</link>
      <description>&lt;P&gt;1. They would need to have a suitable Value of Information and make logical sense for the trend.&lt;/P&gt;
&lt;P&gt;2. Predictive.&lt;/P&gt;
&lt;P&gt;3. Approx 6000 observations&lt;/P&gt;
&lt;P&gt;4.The categorical variables have different levels, I am only modelling categorical variables.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;5. Not sure what this question means.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;6. Yes, I analyzed over 200 variables (categorical) to make sure the levels made logical sense and combined levels where needed.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks for your interest in helping me with this.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 09 Dec 2019 09:04:07 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/610420#M177760</guid>
      <dc:creator>manonlyn</dc:creator>
      <dc:date>2019-12-09T09:04:07Z</dc:date>
    </item>
    <item>
      <title>Re: Validation of Statistical Model Failed</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/610516#M177796</link>
      <description>Re 5, what did you specify as the PARAM option on the CLASS statement? &lt;BR /&gt;&lt;BR /&gt;&lt;A href="https://documentation.sas.com/?docsetId=statug&amp;amp;docsetTarget=statug_logistic_syntax05.htm&amp;amp;docsetVersion=15.1&amp;amp;locale=en" target="_blank"&gt;https://documentation.sas.com/?docsetId=statug&amp;amp;docsetTarget=statug_logistic_syntax05.htm&amp;amp;docsetVersion=15.1&amp;amp;locale=en&lt;/A&gt;</description>
      <pubDate>Mon, 09 Dec 2019 17:21:48 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/610516#M177796</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2019-12-09T17:21:48Z</dc:date>
    </item>
    <item>
      <title>Re: Validation of Statistical Model Failed</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/610672#M177869</link>
      <description>&lt;P&gt;This is a question about Stat, Better post it at Stat forum and calling&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/13633"&gt;@StatDave&lt;/a&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 10 Dec 2019 11:46:21 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/610672#M177869</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2019-12-10T11:46:21Z</dc:date>
    </item>
    <item>
      <title>Re: Validation of Statistical Model Failed</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/610684#M177876</link>
      <description>&lt;P&gt;I don't think I have anything under that option. I just put CLASS then list the class variables followed by (REF = 'NULL')&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks.&lt;/P&gt;</description>
      <pubDate>Tue, 10 Dec 2019 12:09:09 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/610684#M177876</guid>
      <dc:creator>manonlyn</dc:creator>
      <dc:date>2019-12-10T12:09:09Z</dc:date>
    </item>
    <item>
      <title>Re: Validation of Statistical Model Failed</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/610685#M177877</link>
      <description>&lt;P&gt;I've got it to validate today! Thanks for all your help with this it's really appreciated. If I have similar questions in the future I'll be sure to post them on the stat board.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 10 Dec 2019 12:10:04 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/610685#M177877</guid>
      <dc:creator>manonlyn</dc:creator>
      <dc:date>2019-12-10T12:10:04Z</dc:date>
    </item>
    <item>
      <title>Re: Validation of Statistical Model Failed</title>
      <link>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/610760#M177917</link>
      <description>So you want all your categorical variables to be compared against the case where the value is NULL?&lt;BR /&gt;Usually you want PARAM=REF in your CLASS statement as well.</description>
      <pubDate>Tue, 10 Dec 2019 16:12:40 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Programming/Validation-of-Statistical-Model-Failed/m-p/610760#M177917</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2019-12-10T16:12:40Z</dc:date>
    </item>
  </channel>
</rss>

