<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: LSTM with dltrain in SAS Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Data-Science/LSTM-with-dltrain/m-p/765317#M8859</link>
    <description>&lt;P&gt;Thank you very much for your answers. You have helped me a lot.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am not sure if I understood the second point correctly. Does it mean that you can choose between objective function OR loss as validation score?&lt;BR /&gt;How to understand the expression in the brackets (error + regularization)?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks in advance.&lt;/P&gt;&lt;P&gt;Torben&lt;/P&gt;</description>
    <pubDate>Wed, 01 Sep 2021 12:08:50 GMT</pubDate>
    <dc:creator>Torben2</dc:creator>
    <dc:date>2021-09-01T12:08:50Z</dc:date>
    <item>
      <title>LSTM with dltrain</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/LSTM-with-dltrain/m-p/763468#M8855</link>
      <description>&lt;P&gt;Hello all,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have two questions about dttrain. I am using dltrain to predict time series using LSTM.&lt;/P&gt;&lt;P&gt;There are two options for which I would like to have more information:&lt;BR /&gt;- stagnation&lt;BR /&gt;- nthreads&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Stagnation:&lt;BR /&gt;SAS help says the following: "&lt;EM&gt;specifies the number of iterations completed without improvement before stopping the optimization early. When the validTable parameter is specified, the validation scores are monitored for stagnation.&lt;/EM&gt;"&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Does iterations mean epochs? If not, what are iterations?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;What exactly is meant by validation scores? Loss or error or both together?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have tried a few settings, including stagnation = 1. However, even with an increase in validation errors, the training (including validation table) did not stop before the end of the specified maxEpochs.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;nthreads:&lt;BR /&gt;Can the runtime of the training be reduced by specifying threads? What is a reasonable number of threads and how can I determine it for my system?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks a lot!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Many greetings&lt;BR /&gt;Torben&lt;/P&gt;</description>
      <pubDate>Tue, 24 Aug 2021 10:05:34 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/LSTM-with-dltrain/m-p/763468#M8855</guid>
      <dc:creator>Torben2</dc:creator>
      <dc:date>2021-08-24T10:05:34Z</dc:date>
    </item>
    <item>
      <title>Re: LSTM with dltrain</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/LSTM-with-dltrain/m-p/764452#M8857</link>
      <description>&lt;P&gt;1. epochs are group of iterations, representing when all the data has been used in updating weight.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;in stochastic gradient descent, each iteration we calculate derivatives and update weights, using only a small sample of all data. This way, when we have huge data, we can update quick. (For more information, google stochastic gradient descent.)&lt;/P&gt;
&lt;P&gt;If we have 500 data, each iteration we sample 50, then it take 10 iterations for all data to be used once.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;this 10 iterations, is an epoch.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;2. validation score: the objective function, or loss. (error + regularization)&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;3. nthreads: is how many GPU devices you use for calculation, not related to the algorithm. SAS Viya use parallel computing, so its like how many computers you want to do the calculation. Larger number=&amp;gt; faster. But you can't have it larger than the available GPU your IT gives you.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;4. stagnation: your understanding is correct. I suspect is keep going because of the objective function value is still going down, though the error stopped going down.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 27 Aug 2021 13:02:32 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/LSTM-with-dltrain/m-p/764452#M8857</guid>
      <dc:creator>zhongxiuliu</dc:creator>
      <dc:date>2021-08-27T13:02:32Z</dc:date>
    </item>
    <item>
      <title>Re: LSTM with dltrain</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/LSTM-with-dltrain/m-p/765317#M8859</link>
      <description>&lt;P&gt;Thank you very much for your answers. You have helped me a lot.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am not sure if I understood the second point correctly. Does it mean that you can choose between objective function OR loss as validation score?&lt;BR /&gt;How to understand the expression in the brackets (error + regularization)?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks in advance.&lt;/P&gt;&lt;P&gt;Torben&lt;/P&gt;</description>
      <pubDate>Wed, 01 Sep 2021 12:08:50 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/LSTM-with-dltrain/m-p/765317#M8859</guid>
      <dc:creator>Torben2</dc:creator>
      <dc:date>2021-09-01T12:08:50Z</dc:date>
    </item>
    <item>
      <title>Re: LSTM with dltrain</title>
      <link>https://communities.sas.com/t5/SAS-Data-Science/LSTM-with-dltrain/m-p/765342#M8860</link>
      <description>&lt;P&gt;I meant objective function and loss function are often used to describe the same thing &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt; However, they are different from error function.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Both function are error function (the error) + regularization (e.g., the squared or absolute value of weights; some people call it R1, R2; some people call it Lasso Ridge).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The reason behind this is: if we just minimize the error, we can easily get a model with very big weights, which makes our activation function's slope really deep (a little change in x, causes big change in y); our model would overfit, unstable and sensitive to noise .&lt;/P&gt;
&lt;P&gt;Minimizing both error and the weights, makes our neural network less sensitive to noises in data, and more generalizable.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 01 Sep 2021 13:14:33 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Data-Science/LSTM-with-dltrain/m-p/765342#M8860</guid>
      <dc:creator>zhongxiuliu</dc:creator>
      <dc:date>2021-09-01T13:14:33Z</dc:date>
    </item>
  </channel>
</rss>

