<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Model Selection: Quantselect results vs individual R^2 results obtained by glm of each model in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Model-Selection-Quantselect-results-vs-individual-R-2-results/m-p/912381#M45293</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;R-squared is not used for model selection in PROC QUANTREG (PROC QUANTSELECT).&lt;/P&gt;
&lt;P&gt;The model selection can be based on the minimization of the average check loss (ACL) computed from the validation data.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;As&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/18408"&gt;@Ksharp&lt;/a&gt;&amp;nbsp;correctly points out, you are not "optimizing" mean prediction (conditional mean of the response),&lt;/P&gt;
&lt;P&gt;but you are "optimizing" the fit of the&amp;nbsp;entire conditional distribution.&lt;BR /&gt;(Although quantile regression is most often used to model specific conditional quantiles of the response, its full potential&lt;BR /&gt;lies in modeling the entire conditional distribution.)&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;UL class="lia-list-style-type-square"&gt;
&lt;LI&gt;SAS Global Forum 2017 -- Paper SAS525-2017&lt;BR /&gt;Five Things You Should Know about Quantile Regression&lt;BR /&gt;Robert N. Rodriguez and Yonggang Yao, SAS Institute Inc.&lt;BR /&gt;&lt;A href="https://support.sas.com/resources/papers/proceedings17/SAS0525-2017.pdf" target="_blank"&gt;https://support.sas.com/resources/papers/proceedings17/SAS0525-2017.pdf&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Fast Quantile Process Regression&lt;BR /&gt;&lt;A href="https://communities.sas.com/t5/Research-and-Science-from-SAS/Fast-Quantile-Process-Regression/ta-p/777965" target="_blank"&gt;https://communities.sas.com/t5/Research-and-Science-from-SAS/Fast-Quantile-Process-Regression/ta-p/777965&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Koen&lt;/P&gt;</description>
    <pubDate>Sun, 21 Jan 2024 13:51:09 GMT</pubDate>
    <dc:creator>sbxkoenk</dc:creator>
    <dc:date>2024-01-21T13:51:09Z</dc:date>
    <item>
      <title>Model Selection: Quantselect results vs individual R^2 results obtained by glm of each model</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Model-Selection-Quantselect-results-vs-individual-R-2-results/m-p/912272#M45288</link>
      <description>&lt;P class=""&gt;Hi&lt;/P&gt;&lt;P class=""&gt;I am trying proc quantselect for the first time in SAS, with the following syntax:&lt;/P&gt;&lt;P class=""&gt;proc quantselect data=data;&lt;BR /&gt;class classvar1;&lt;BR /&gt;model y=scalevar1*classvar1 scalevar1 classvar1 / details=all selection=stepwise (select=sl slentry=0.05 slstay=0.1 choose=adjr1);&lt;BR /&gt;run;&lt;/P&gt;&lt;P class=""&gt;the model selected by proc quantselect is y=scalevar1*classvar1&lt;/P&gt;&lt;P class=""&gt;&amp;nbsp;&lt;/P&gt;&lt;P class=""&gt;Now if I run glm testing the 4 different models, ie:&lt;/P&gt;&lt;P class=""&gt;&amp;nbsp;&lt;/P&gt;&lt;P class=""&gt;Model 1 (selected through proc quantselect):&lt;/P&gt;&lt;P class=""&gt;proc glm data=data;class classvar1;model y=scalevar1*classvar1 / effectsize solution;run;&lt;/P&gt;&lt;P class=""&gt;&amp;nbsp;&lt;/P&gt;&lt;P class=""&gt;Model 2&lt;/P&gt;&lt;P class=""&gt;proc glm data=data;class classvar1;model y=scalevar1*classvar1 scalevar1 classvar1 / effectsize solution;run;&lt;/P&gt;&lt;P class=""&gt;&amp;nbsp;&lt;/P&gt;&lt;P class=""&gt;Model 3&lt;/P&gt;&lt;P class=""&gt;proc glm data=data;class classvar1;model y=scalevar1*classvar1 scalevar1 / effectsize solution;run;&lt;/P&gt;&lt;P class=""&gt;&amp;nbsp;&lt;/P&gt;&lt;P class=""&gt;Model 4&lt;/P&gt;&lt;P class=""&gt;proc glm data=data;class classvar1;model y=scalevar1 classvar1 / effectsize solution;run;&lt;/P&gt;&lt;P class=""&gt;&amp;nbsp;&lt;/P&gt;&lt;P class=""&gt;Then the R^2 value of model 1 (0.545252) is lower than that of Model 2 (0.570148). I am not sure about how then PROC QUANTSELECT selected model 1 over model 2. Could it be because quantselect doesn't use R^2? I based the model choice of the adjuster R for quantile regression, even though I am not sure what that is.&lt;/P&gt;&lt;P class=""&gt;Any explanations would be greatly appreciated&lt;/P&gt;&lt;P class=""&gt;Thanks!&lt;/P&gt;&lt;P class=""&gt;Neri&lt;/P&gt;</description>
      <pubDate>Fri, 19 Jan 2024 20:38:46 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Model-Selection-Quantselect-results-vs-individual-R-2-results/m-p/912272#M45288</guid>
      <dc:creator>Neridhren</dc:creator>
      <dc:date>2024-01-19T20:38:46Z</dc:date>
    </item>
    <item>
      <title>Re: Model Selection: Quantselect results vs individual R^2 results obtained by glm of each model</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Model-Selection-Quantselect-results-vs-individual-R-2-results/m-p/912379#M45292</link>
      <description>proc quantselect is based on MEDIAN,&lt;BR /&gt;whereas , proc glm/glmselect is based on MEAN, if you want to build a quantile regression, just use proc quantselect.</description>
      <pubDate>Sun, 21 Jan 2024 09:28:37 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Model-Selection-Quantselect-results-vs-individual-R-2-results/m-p/912379#M45292</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2024-01-21T09:28:37Z</dc:date>
    </item>
    <item>
      <title>Re: Model Selection: Quantselect results vs individual R^2 results obtained by glm of each model</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Model-Selection-Quantselect-results-vs-individual-R-2-results/m-p/912381#M45293</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;R-squared is not used for model selection in PROC QUANTREG (PROC QUANTSELECT).&lt;/P&gt;
&lt;P&gt;The model selection can be based on the minimization of the average check loss (ACL) computed from the validation data.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;As&amp;nbsp;&lt;a href="https://communities.sas.com/t5/user/viewprofilepage/user-id/18408"&gt;@Ksharp&lt;/a&gt;&amp;nbsp;correctly points out, you are not "optimizing" mean prediction (conditional mean of the response),&lt;/P&gt;
&lt;P&gt;but you are "optimizing" the fit of the&amp;nbsp;entire conditional distribution.&lt;BR /&gt;(Although quantile regression is most often used to model specific conditional quantiles of the response, its full potential&lt;BR /&gt;lies in modeling the entire conditional distribution.)&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;UL class="lia-list-style-type-square"&gt;
&lt;LI&gt;SAS Global Forum 2017 -- Paper SAS525-2017&lt;BR /&gt;Five Things You Should Know about Quantile Regression&lt;BR /&gt;Robert N. Rodriguez and Yonggang Yao, SAS Institute Inc.&lt;BR /&gt;&lt;A href="https://support.sas.com/resources/papers/proceedings17/SAS0525-2017.pdf" target="_blank"&gt;https://support.sas.com/resources/papers/proceedings17/SAS0525-2017.pdf&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Fast Quantile Process Regression&lt;BR /&gt;&lt;A href="https://communities.sas.com/t5/Research-and-Science-from-SAS/Fast-Quantile-Process-Regression/ta-p/777965" target="_blank"&gt;https://communities.sas.com/t5/Research-and-Science-from-SAS/Fast-Quantile-Process-Regression/ta-p/777965&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Koen&lt;/P&gt;</description>
      <pubDate>Sun, 21 Jan 2024 13:51:09 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Model-Selection-Quantselect-results-vs-individual-R-2-results/m-p/912381#M45293</guid>
      <dc:creator>sbxkoenk</dc:creator>
      <dc:date>2024-01-21T13:51:09Z</dc:date>
    </item>
    <item>
      <title>Re: Model Selection: Quantselect results vs individual R^2 results obtained by glm of each model</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Model-Selection-Quantselect-results-vs-individual-R-2-results/m-p/912566#M45300</link>
      <description>If means are appropriate, try glmselect. If you want to model the median or other quantile, then quantreg.</description>
      <pubDate>Mon, 22 Jan 2024 21:37:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Model-Selection-Quantselect-results-vs-individual-R-2-results/m-p/912566#M45300</guid>
      <dc:creator>gp4</dc:creator>
      <dc:date>2024-01-22T21:37:43Z</dc:date>
    </item>
    <item>
      <title>Re: Model Selection: Quantselect results vs individual R^2 results obtained by glm of each model</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Model-Selection-Quantselect-results-vs-individual-R-2-results/m-p/913603#M45349</link>
      <description>&lt;P&gt;Plenty of good advice has already been given. I do want to point out something about R^2 that is happening when you run GLM on the different models. For a given dataset, the more independent terms you have in the model, the higher the R^2 value. I would have been really, really surprised if Model 1 had given you a larger R^2 than Model 2.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;SteveDenham&lt;/P&gt;</description>
      <pubDate>Tue, 30 Jan 2024 15:37:19 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Model-Selection-Quantselect-results-vs-individual-R-2-results/m-p/913603#M45349</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2024-01-30T15:37:19Z</dc:date>
    </item>
    <item>
      <title>Re: Model Selection: Quantselect results vs individual R^2 results obtained by glm of each model</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Model-Selection-Quantselect-results-vs-individual-R-2-results/m-p/913605#M45350</link>
      <description>&lt;P&gt;Hi Steve&lt;/P&gt;&lt;P&gt;Thanks for the answer, and would love it if you could go a little deeper into your comment. As suggested I repeated my analysis using glmselect, and again model1 is chosen over the rest, but model2 has a higher R^2.&amp;nbsp; So what you said is relevant, but I'd appreciate it if you could explain a bit more.&lt;/P&gt;&lt;P&gt;The other piece of information to add is that glm of model 1 gives a significant effect for scalevar1*classvar1, whereas glm of model 2 is only significant for the main effect of scalevar1.&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;Neri&lt;/P&gt;</description>
      <pubDate>Tue, 30 Jan 2024 15:42:20 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Model-Selection-Quantselect-results-vs-individual-R-2-results/m-p/913605#M45350</guid>
      <dc:creator>Neridhren</dc:creator>
      <dc:date>2024-01-30T15:42:20Z</dc:date>
    </item>
    <item>
      <title>Re: Model Selection: Quantselect results vs individual R^2 results obtained by glm of each model</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Model-Selection-Quantselect-results-vs-individual-R-2-results/m-p/913607#M45351</link>
      <description>&lt;P&gt;Any introductory text on regression analysis will walk you through the algebra to prove that increasing the number of predictors will increase the R^2. See this YouTube video for a quick walk through&amp;nbsp;&lt;A href="https://www.youtube.com/watch?v=CGQpi580sZM" target="_self"&gt;https://www.youtube.com/watch?v=CGQpi580sZM&lt;/A&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The video goes on to talk about the adjusted R^2, which penalizes for the number of predictors.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;When it comes to multiple regression and model selection, there is a lot of literature out there. It turns out that almost every algorithm for model selection has at least some drawback, but it is worse for stepwise and all possible subset methods.&amp;nbsp; Good luck.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;SteveDenham&lt;/P&gt;</description>
      <pubDate>Tue, 30 Jan 2024 15:56:06 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Model-Selection-Quantselect-results-vs-individual-R-2-results/m-p/913607#M45351</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2024-01-30T15:56:06Z</dc:date>
    </item>
    <item>
      <title>Re: Model Selection: Quantselect results vs individual R^2 results obtained by glm of each model</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Model-Selection-Quantselect-results-vs-individual-R-2-results/m-p/913613#M45352</link>
      <description>Awesome! Great resource. Thanks again&lt;BR /&gt;Neri</description>
      <pubDate>Tue, 30 Jan 2024 16:45:58 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Model-Selection-Quantselect-results-vs-individual-R-2-results/m-p/913613#M45352</guid>
      <dc:creator>Neridhren</dc:creator>
      <dc:date>2024-01-30T16:45:58Z</dc:date>
    </item>
  </channel>
</rss>

