<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: What is causing a scored dataset from a neural network to have missing values for every probabil in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/What-is-causing-a-scored-dataset-from-a-neural-network-to-have/m-p/942400#M10917</link>
    <description>&lt;P&gt;I did notice that my gradient boosting model which uses astore to make predictions does have the _warn_ variable. However, for the scoring data produced from this model, only a few select rows have an entry (an M) in the _WARN_ variable, and they still have predicted probabilities (rather than missing values). This is unlike the scored data from the Neural Network that makes use of the CODE function; they neither have the _WARN_ variable and they are all missing.&lt;/P&gt;</description>
    <pubDate>Tue, 03 Sep 2024 23:03:48 GMT</pubDate>
    <dc:creator>William29</dc:creator>
    <dc:date>2024-09-03T23:03:48Z</dc:date>
    <item>
      <title>What is causing a scored dataset from a neural network to have missing values for every probability?</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/What-is-causing-a-scored-dataset-from-a-neural-network-to-have/m-p/942150#M10913</link>
      <description>&lt;P&gt;I trained a new neural network on a dataset using the following code:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="William29_0-1725256823412.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/99863i8E223EEED7FC7FC1/image-size/medium?v=v2&amp;amp;px=400" role="button" title="William29_0-1725256823412.png" alt="William29_0-1725256823412.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;When looking at the scored dataset on the test dataset, it looks fine with no missing probability values as can be seen below:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="William29_1-1725257024779.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/99864i99602DDB04362647/image-size/medium?v=v2&amp;amp;px=400" role="button" title="William29_1-1725257024779.png" alt="William29_1-1725257024779.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However, I then decided in a separate sas file to score a new data using the scoring code that was saved when running the model (see the code option near the bottom of the first screen shot)&lt;/P&gt;&lt;P&gt;This code (which scores the new dataset is below):&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="William29_2-1725257366731.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/99865i1CF943FBE01D335B/image-size/medium?v=v2&amp;amp;px=400" role="button" title="William29_2-1725257366731.png" alt="William29_2-1725257366731.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;However, in the outputted data, all of the different probabilities (of the predicted categories of the target variable) for every observation is missing (see the data below):&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="William29_3-1725257864125.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/99866i60D370D5BAF387E8/image-size/medium?v=v2&amp;amp;px=400" role="button" title="William29_3-1725257864125.png" alt="William29_3-1725257864125.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;It is like that for every variable.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This is despite the fact that this new data that was scored actually comes from the original dataset used to train and test the model. I have checked whether it is due to missing values in the columns; but for this to be an issue like it is, there would have to be missing values in every row of the predictor variables, yet nearly every row does not have missing values.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have attached the log for the code that includes the part which does the scoring of the data.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am wondering what the cause of these missing values is, essentially and how to fix it?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I also note that I trained a gradient boosting model on the same data using the code below:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="William29_4-1725258471725.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/99867i7A7DB6F2829200A9/image-size/medium?v=v2&amp;amp;px=400" role="button" title="William29_4-1725258471725.png" alt="William29_4-1725258471725.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;As well as this to save the model:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="William29_6-1725258754949.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/99869iFF2033C4E50DE592/image-size/medium?v=v2&amp;amp;px=400" role="button" title="William29_6-1725258754949.png" alt="William29_6-1725258754949.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I then used Astore to score new data according to that gradient boosting model. This new data was the same as that whose scoring by the neural network model led to the missing value problem. I scored the data with the gradient boosting model in the same file as that used to score the neural network model and the code used was below:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="William29_5-1725258688644.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/99868i40DBE07A578F7247/image-size/medium?v=v2&amp;amp;px=400" role="button" title="William29_5-1725258688644.png" alt="William29_5-1725258688644.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;There were no missing values in the probabilities in the scored data in this case. An example of this scored data with no missing values is below:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="William29_7-1725258989195.png" style="width: 400px;"&gt;&lt;img src="https://communities.sas.com/t5/image/serverpage/image-id/99870iF26CCF5827A48E1F/image-size/medium?v=v2&amp;amp;px=400" role="button" title="William29_7-1725258989195.png" alt="William29_7-1725258989195.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So, I am wondering what could be causing (and how can I fix) this issue with missing probabilities with the neural network model given that the dataset that is fed into the code to be scored is exactly the same as the data that is fed into the astore procedure for the gradient boosting model&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 02 Sep 2024 06:37:10 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/What-is-causing-a-scored-dataset-from-a-neural-network-to-have/m-p/942150#M10913</guid>
      <dc:creator>William29</dc:creator>
      <dc:date>2024-09-02T06:37:10Z</dc:date>
    </item>
    <item>
      <title>Re: What is causing a scored dataset from a neural network to have missing values for every probabil</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/What-is-causing-a-scored-dataset-from-a-neural-network-to-have/m-p/942179#M10914</link>
      <description>&lt;P&gt;Don't you have a&amp;nbsp;&lt;SPAN&gt;variable named &lt;STRONG&gt;_WARN_&lt;/STRONG&gt; in the scored data set that indicates why the model could not be applied?&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;If I have time later today, I will analyze the log-file.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Moving your question to "SAS Data Science" - board.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Koen&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 02 Sep 2024 09:14:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/What-is-causing-a-scored-dataset-from-a-neural-network-to-have/m-p/942179#M10914</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2024-09-02T09:14:37Z</dc:date>
    </item>
    <item>
      <title>Re: What is causing a scored dataset from a neural network to have missing values for every probabil</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/What-is-causing-a-scored-dataset-from-a-neural-network-to-have/m-p/942241#M10915</link>
      <description>&lt;P&gt;I have looked and I definitely do not have a variable called _WARN_ in the scored dataset. Should this variable be present?&lt;/P&gt;</description>
      <pubDate>Tue, 03 Sep 2024 04:59:27 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/What-is-causing-a-scored-dataset-from-a-neural-network-to-have/m-p/942241#M10915</guid>
      <dc:creator>William29</dc:creator>
      <dc:date>2024-09-03T04:59:27Z</dc:date>
    </item>
    <item>
      <title>Re: What is causing a scored dataset from a neural network to have missing values for every probabil</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/What-is-causing-a-scored-dataset-from-a-neural-network-to-have/m-p/942261#M10916</link>
      <description>&lt;BLOCKQUOTE&gt;&lt;HR /&gt;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/445770"&gt;@William29&lt;/a&gt;&amp;nbsp;wrote:&lt;BR /&gt;
&lt;P&gt;I have looked and I definitely do not have a variable called _WARN_ in the scored dataset. Should this variable be present?&lt;/P&gt;
&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;Not necessarily.&lt;BR /&gt;I know that Enterprise Miner (that's a SAS 9.x tool) was including this variable when scoring (new) data with a deployed model.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Koen&lt;/P&gt;</description>
      <pubDate>Tue, 03 Sep 2024 10:15:01 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/What-is-causing-a-scored-dataset-from-a-neural-network-to-have/m-p/942261#M10916</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2024-09-03T10:15:01Z</dc:date>
    </item>
    <item>
      <title>Re: What is causing a scored dataset from a neural network to have missing values for every probabil</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/What-is-causing-a-scored-dataset-from-a-neural-network-to-have/m-p/942400#M10917</link>
      <description>&lt;P&gt;I did notice that my gradient boosting model which uses astore to make predictions does have the _warn_ variable. However, for the scoring data produced from this model, only a few select rows have an entry (an M) in the _WARN_ variable, and they still have predicted probabilities (rather than missing values). This is unlike the scored data from the Neural Network that makes use of the CODE function; they neither have the _WARN_ variable and they are all missing.&lt;/P&gt;</description>
      <pubDate>Tue, 03 Sep 2024 23:03:48 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/What-is-causing-a-scored-dataset-from-a-neural-network-to-have/m-p/942400#M10917</guid>
      <dc:creator>William29</dc:creator>
      <dc:date>2024-09-03T23:03:48Z</dc:date>
    </item>
    <item>
      <title>Re: What is causing a scored dataset from a neural network to have missing values for every probabil</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/What-is-causing-a-scored-dataset-from-a-neural-network-to-have/m-p/942846#M10919</link>
      <description>I managed to solve this issue by looking at the score code file itself and finding the conditions that lead to what it calls a "bad val" where-apon it sets everything to empty. It had to do with categories of the categorical predictors in the new data not in the training data</description>
      <pubDate>Fri, 06 Sep 2024 04:17:39 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/What-is-causing-a-scored-dataset-from-a-neural-network-to-have/m-p/942846#M10919</guid>
      <dc:creator>William29</dc:creator>
      <dc:date>2024-09-06T04:17:39Z</dc:date>
    </item>
    <item>
      <title>Re: What is causing a scored dataset from a neural network to have missing values for every probabil</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/What-is-causing-a-scored-dataset-from-a-neural-network-to-have/m-p/943143#M10921</link>
      <description>&lt;P&gt;Thanks for feedback.&lt;BR /&gt;You can label your own answer above as the solution.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Koen&lt;/P&gt;</description>
      <pubDate>Mon, 09 Sep 2024 13:48:10 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/What-is-causing-a-scored-dataset-from-a-neural-network-to-have/m-p/943143#M10921</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2024-09-09T13:48:10Z</dc:date>
    </item>
  </channel>
</rss>

