<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: SAS EM predicted probabilities in score data set are all the same in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/SAS-EM-predicted-probabilities-in-score-data-set-are-all-the/m-p/238371#M3496</link>
    <description>&lt;P&gt;So, you have built a linear regression model. I take it that the dataset shown in your third screenshot contains both the independent variables in your model and the prediction for the target variable LOG_TargetD. (I don't see any "probabilities" in it, though.) The prediction is a linear function of the independent variables ("features"). Hence, if two individuals have the same values for each of the features, their predicted target values are necessarily equal, too.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If all individuals have identical &lt;SPAN&gt;values for each of the &lt;EM&gt;features&lt;/EM&gt;, then you should be wondering why &lt;EM&gt;this&lt;/EM&gt; is the case. The equality of predictions would be a mere consequence in this situation.&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;Your third screenshot shows identical values in each column. Is this the general pattern for the whole dataset?&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 08 Dec 2015 19:39:46 GMT</pubDate>
    <dc:creator>FreelanceReinh</dc:creator>
    <dc:date>2015-12-08T19:39:46Z</dc:date>
    <item>
      <title>SAS EM predicted probabilities in score data set are all the same</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/SAS-EM-predicted-probabilities-in-score-data-set-are-all-the/m-p/238332#M3495</link>
      <description>&lt;P&gt;Ive built a model in SAS EM to predict a target(interval) based on different features. I imputed some of them, smoothed some others, and binned, or transformed rest of the features, just some basic data preparation stuff before building the model. Then partition data into training (65) validation(35), then a linear regression was built to predict the target, it has validation error rate of 13%, which is fine.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However the problem kicks in when i imported another score data set (actually the it is the same dataset i prepared above, deleting target variable). It has all prediction for target the same value. I cant understand the reasoning before this. My model was fine in the process of building it, and the score data set is just prepared dataset i used to train and validate. what is wrong?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;and this is my original&amp;nbsp;dataset.&lt;/P&gt;&lt;P&gt;&lt;IMG src="https://communities.sas.com/t5/image/serverpage/image-id/1088i8C3BBFF1163A2A8C/image-size/original?v=mpbl-1&amp;amp;px=-1" border="0" alt="Capture.PNG" title="Capture.PNG" /&gt;&lt;/P&gt;&lt;P&gt;This is my model comparision results&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;IMG src="https://communities.sas.com/t5/image/serverpage/image-id/1089iABC17A69A55D29F8/image-size/original?v=mpbl-1&amp;amp;px=-1" border="0" alt="Capture3.PNG" title="Capture3.PNG" /&gt;&lt;/P&gt;&lt;P&gt;and this is my score&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;IMG src="https://communities.sas.com/t5/image/serverpage/image-id/1090i821C21D2C09DE595/image-size/original?v=mpbl-1&amp;amp;px=-1" border="0" alt="Capture.PNG" title="Capture.PNG" /&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 08 Dec 2015 16:47:06 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/SAS-EM-predicted-probabilities-in-score-data-set-are-all-the/m-p/238332#M3495</guid>
      <dc:creator>Xiaojun</dc:creator>
      <dc:date>2015-12-08T16:47:06Z</dc:date>
    </item>
    <item>
      <title>Re: SAS EM predicted probabilities in score data set are all the same</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/SAS-EM-predicted-probabilities-in-score-data-set-are-all-the/m-p/238371#M3496</link>
      <description>&lt;P&gt;So, you have built a linear regression model. I take it that the dataset shown in your third screenshot contains both the independent variables in your model and the prediction for the target variable LOG_TargetD. (I don't see any "probabilities" in it, though.) The prediction is a linear function of the independent variables ("features"). Hence, if two individuals have the same values for each of the features, their predicted target values are necessarily equal, too.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If all individuals have identical &lt;SPAN&gt;values for each of the &lt;EM&gt;features&lt;/EM&gt;, then you should be wondering why &lt;EM&gt;this&lt;/EM&gt; is the case. The equality of predictions would be a mere consequence in this situation.&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;Your third screenshot shows identical values in each column. Is this the general pattern for the whole dataset?&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 08 Dec 2015 19:39:46 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/SAS-EM-predicted-probabilities-in-score-data-set-are-all-the/m-p/238371#M3496</guid>
      <dc:creator>FreelanceReinh</dc:creator>
      <dc:date>2015-12-08T19:39:46Z</dc:date>
    </item>
  </channel>
</rss>

