<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic HPGENSELECT and LASSO in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/HPGENSELECT-and-LASSO/m-p/945051#M47228</link>
    <description>&lt;P&gt;I am doing LASSO using HPGENSELECT. Below is the code that I am using.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;PROC HPGENSELECT DATA=library.Bigdata LASSORHO= 0.80 LASSOSTEPS= 50;&lt;BR /&gt;PARTITION roleVar=Group(train='Group1' validate='Group2');&lt;BR /&gt;CLASSVAR1 VAR2;&lt;BR /&gt;MODEL MASLD (descending)= VAR1 VAR2 ...... VAR(X);&lt;BR /&gt;/ DISTRIBUTION = BINARY;&lt;BR /&gt;SELECTION METHOD=LASSO (CHOOSE=VALIDATE STOP=None) DETAILS=ALL;&lt;BR /&gt;by _imputation_;&lt;BR /&gt;ods output ParameterEstimates=data;&lt;BR /&gt;run;&lt;BR /&gt;quit;&lt;/P&gt;&lt;P&gt;I am facing a couple of problems here.&amp;nbsp; LASSO is selecting the optimal model as one with the lowest ASE, which is always the last step(in case the 50th step) So, I need to increase the LASSOSTEPS=100, still it selects the 100th step as the optimal model It keeps on doing that how much ever I increase the LASSOSTEPS. And I increase the LASSOSTEPS, the number of variables selected into the model also increases and at around TEPS -80, the SAS selects all the 45 variables in to the optimal mode; Do, basically its not doing any more variable selection by the time&amp;nbsp; it reaches step 80, but still&amp;nbsp; last step (say 100th) is selected as the optimal model by LASSO.&amp;nbsp; Does anyone know why is it doing that? Also, the number of variable sthat SAS selects varies by the value of LAMBDARHO and LAMDASTEPS. Is there a way for me to what is the optimal values for these and how to get that for my model? This is my reference-&lt;A href="https://www.mwsug.org/proceedings/2017/AA/MWSUG-2017-AA02.pdf" target="_blank"&gt;https://www.mwsug.org/proceedings/2017/AA/MWSUG-2017-AA02.pdf&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I also use SAS documentation for HPGENSELECT as my reference. Can anyone help with this? Thank you.&lt;/P&gt;</description>
    <pubDate>Tue, 24 Sep 2024 14:07:33 GMT</pubDate>
    <dc:creator>cHirap</dc:creator>
    <dc:date>2024-09-24T14:07:33Z</dc:date>
    <item>
      <title>HPGENSELECT and LASSO</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/HPGENSELECT-and-LASSO/m-p/945051#M47228</link>
      <description>&lt;P&gt;I am doing LASSO using HPGENSELECT. Below is the code that I am using.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;PROC HPGENSELECT DATA=library.Bigdata LASSORHO= 0.80 LASSOSTEPS= 50;&lt;BR /&gt;PARTITION roleVar=Group(train='Group1' validate='Group2');&lt;BR /&gt;CLASSVAR1 VAR2;&lt;BR /&gt;MODEL MASLD (descending)= VAR1 VAR2 ...... VAR(X);&lt;BR /&gt;/ DISTRIBUTION = BINARY;&lt;BR /&gt;SELECTION METHOD=LASSO (CHOOSE=VALIDATE STOP=None) DETAILS=ALL;&lt;BR /&gt;by _imputation_;&lt;BR /&gt;ods output ParameterEstimates=data;&lt;BR /&gt;run;&lt;BR /&gt;quit;&lt;/P&gt;&lt;P&gt;I am facing a couple of problems here.&amp;nbsp; LASSO is selecting the optimal model as one with the lowest ASE, which is always the last step(in case the 50th step) So, I need to increase the LASSOSTEPS=100, still it selects the 100th step as the optimal model It keeps on doing that how much ever I increase the LASSOSTEPS. And I increase the LASSOSTEPS, the number of variables selected into the model also increases and at around TEPS -80, the SAS selects all the 45 variables in to the optimal mode; Do, basically its not doing any more variable selection by the time&amp;nbsp; it reaches step 80, but still&amp;nbsp; last step (say 100th) is selected as the optimal model by LASSO.&amp;nbsp; Does anyone know why is it doing that? Also, the number of variable sthat SAS selects varies by the value of LAMBDARHO and LAMDASTEPS. Is there a way for me to what is the optimal values for these and how to get that for my model? This is my reference-&lt;A href="https://www.mwsug.org/proceedings/2017/AA/MWSUG-2017-AA02.pdf" target="_blank"&gt;https://www.mwsug.org/proceedings/2017/AA/MWSUG-2017-AA02.pdf&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I also use SAS documentation for HPGENSELECT as my reference. Can anyone help with this? Thank you.&lt;/P&gt;</description>
      <pubDate>Tue, 24 Sep 2024 14:07:33 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/HPGENSELECT-and-LASSO/m-p/945051#M47228</guid>
      <dc:creator>cHirap</dc:creator>
      <dc:date>2024-09-24T14:07:33Z</dc:date>
    </item>
  </channel>
</rss>

