<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Adjusting Predicted probabilities in SAS Base in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146893#M1454</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thanks @stat@sas !&lt;BR /&gt;An awesome read... I stopped doing what I was doing just to read this even if I don't use base SAS (I use EM). Still, very nice article... Thanks!&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Tue, 06 May 2014 15:10:08 GMT</pubDate>
    <dc:creator>M_Maldonado</dc:creator>
    <dc:date>2014-05-06T15:10:08Z</dc:date>
    <item>
      <title>Adjusting Predicted probabilities in SAS Base</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146888#M1449</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi all,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have the following situation, I have created a training and validation sample in which both the analogy of events/nonevents is the same thus 50-50.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have fitted the training sample and tested my model&amp;nbsp; on the validation sample (using proc logistic) with good results and i want to score now the initial population as a final test from which i made the two samples.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The initial population consists of 80% nonevents (0) and 20% events (1), if am not mistaken this means that i will have to adjust my predicted probabilities on the population to the true event rate since my model was build on a 50-50 analogy is that correct?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;How could i do that in SAS Base? I mean which output from the logistic regression must I investigate and apply this alteration? Any ideas? If i could know how the predicted probability in the first case is calculated maybe i could adjust within that calculation.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thank you in advance&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 06 May 2014 13:21:27 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146888#M1449</guid>
      <dc:creator>chemicalab</dc:creator>
      <dc:date>2014-05-06T13:21:27Z</dc:date>
    </item>
    <item>
      <title>Re: Adjusting Predicted probabilities in SAS Base</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146889#M1450</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I think you oversampled the population to make event/non-event in 50-50 proportions in your training/validation samples. Check offset option in logistic regression to adjust predicted probabilities.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 06 May 2014 13:43:22 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146889#M1450</guid>
      <dc:creator>stat_sas</dc:creator>
      <dc:date>2014-05-06T13:43:22Z</dc:date>
    </item>
    <item>
      <title>Re: Adjusting Predicted probabilities in SAS Base</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146890#M1451</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Exactly that's what I did, would you happen to have any example on the offset option syntax wise so i could understand it better?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 06 May 2014 13:59:02 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146890#M1451</guid>
      <dc:creator>chemicalab</dc:creator>
      <dc:date>2014-05-06T13:59:02Z</dc:date>
    </item>
    <item>
      <title>Re: Adjusting Predicted probabilities in SAS Base</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146891#M1452</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;See below&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="http://support.sas.com/kb/22/601.html" title="http://support.sas.com/kb/22/601.html"&gt;22601 - How do I adjust for oversampling the event level in a binary logistic model?&lt;/A&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 06 May 2014 14:08:51 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146891#M1452</guid>
      <dc:creator>stat_sas</dc:creator>
      <dc:date>2014-05-06T14:08:51Z</dc:date>
    </item>
    <item>
      <title>Re: Adjusting Predicted probabilities in SAS Base</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146892#M1453</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thnx&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 06 May 2014 14:39:34 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146892#M1453</guid>
      <dc:creator>chemicalab</dc:creator>
      <dc:date>2014-05-06T14:39:34Z</dc:date>
    </item>
    <item>
      <title>Re: Adjusting Predicted probabilities in SAS Base</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146893#M1454</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thanks @stat@sas !&lt;BR /&gt;An awesome read... I stopped doing what I was doing just to read this even if I don't use base SAS (I use EM). Still, very nice article... Thanks!&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 06 May 2014 15:10:08 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146893#M1454</guid>
      <dc:creator>M_Maldonado</dc:creator>
      <dc:date>2014-05-06T15:10:08Z</dc:date>
    </item>
    <item>
      <title>Re: Adjusting Predicted probabilities in SAS Base</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146894#M1455</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;One more question though, by using either prior or offset will i be adjusting the probabilities in the population data that i want to score or the already predicted probabilities in the training and validation set?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 06 May 2014 15:33:13 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146894#M1455</guid>
      <dc:creator>chemicalab</dc:creator>
      <dc:date>2014-05-06T15:33:13Z</dc:date>
    </item>
    <item>
      <title>Re: Adjusting Predicted probabilities in SAS Base</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146895#M1456</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Offset just aligns intercept it does not effect beta coefficients. Use offset in proc logistic using your training data set and get the parameter estimates. Score validation dataset using estimates obtained through training dataset. If you see results are almost similar then you can use this model for population dataset as well.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 06 May 2014 16:40:38 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146895#M1456</guid>
      <dc:creator>stat_sas</dc:creator>
      <dc:date>2014-05-06T16:40:38Z</dc:date>
    </item>
    <item>
      <title>Re: Adjusting Predicted probabilities in SAS Base</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146896#M1457</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Ok , I was just under the impression that since the intercept gets "inflated" that would mean that the predicted probabilities would get inflated too ,thus they would need adjustment too, am I wrong?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 06 May 2014 16:46:50 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146896#M1457</guid>
      <dc:creator>chemicalab</dc:creator>
      <dc:date>2014-05-06T16:46:50Z</dc:date>
    </item>
    <item>
      <title>Re: Adjusting Predicted probabilities in SAS Base</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146897#M1458</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Aha ok i misread since i am adjusting intercept I am good to go, so for scoring the validation which is also of a 50-50 analogy i dont need to adjust, that would be only for the population scoring where the true rates of events and non events is depicted&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 06 May 2014 16:49:29 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146897#M1458</guid>
      <dc:creator>chemicalab</dc:creator>
      <dc:date>2014-05-06T16:49:29Z</dc:date>
    </item>
    <item>
      <title>Re: Adjusting Predicted probabilities in SAS Base</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146898#M1459</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;That is correct. If the objective is to see significance of predictors only then you don't need to use offset. &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 06 May 2014 16:54:55 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146898#M1459</guid>
      <dc:creator>stat_sas</dc:creator>
      <dc:date>2014-05-06T16:54:55Z</dc:date>
    </item>
    <item>
      <title>Re: Adjusting Predicted probabilities in SAS Base</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146899#M1460</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Thank you for your clarifications&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 06 May 2014 19:21:56 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146899#M1460</guid>
      <dc:creator>chemicalab</dc:creator>
      <dc:date>2014-05-06T19:21:56Z</dc:date>
    </item>
    <item>
      <title>Re: Adjusting Predicted probabilities in SAS Base</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146900#M1461</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;I have tried the following :&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;proc freq data=OUTPUT.POPULATION (THE SET WITH 95% nonevents and 5% events) noprint;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; table TARGET / out=priors(drop=percent rename=(count=_prior_));&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; run;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;and i try to score it with the parameters based on the training set:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;PROC LOGISTIC INMODEL=MODELPARAMETERSTRAINING (the 50-50 sample)&amp;nbsp; ;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; SCORE DATA=OUTPUT.POPULATION PRIOR=PRIORS&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; OUTROC=ROC&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; OUT=SCORESPOP PREDICTED=ESTPROB&lt;/P&gt;&lt;P&gt;;&lt;/P&gt;&lt;P&gt;RUN;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Would that be the correct syntax? I am wondering cause when i compare the estimated probabilities of scoring with or without the priors i see a huge difference in p_1&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 07 May 2014 07:35:15 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/Adjusting-Predicted-probabilities-in-SAS-Base/m-p/146900#M1461</guid>
      <dc:creator>chemicalab</dc:creator>
      <dc:date>2014-05-07T07:35:15Z</dc:date>
    </item>
  </channel>
</rss>

