<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Clarification of statement on variables screening in SAS Academy for Data Science</title>
    <link>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Clarification-of-statement-on-variables-screening/m-p/652300#M855</link>
    <description>&lt;P&gt;If you plan to use the best subset selection method in PROC LOGISTIC, you need to get the number of predictor variables down to around 50 or else the CPU time will be fairly excessive. Therefore, if the number of clusters obtained by PROC VARCLUS is much greater than 50, then further screening methods such as the Spearman and Hoeffding correlation statistics could be used to further reduce the number of predictor variables available to PROC LOGISTIC. Very liberal univariate screening simply means reducing the number of variables down to a reasonable number.&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 01 Jun 2020 17:05:05 GMT</pubDate>
    <dc:creator>sasmlp</dc:creator>
    <dc:date>2020-06-01T17:05:05Z</dc:date>
    <item>
      <title>Clarification of statement on variables screening</title>
      <link>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Clarification-of-statement-on-variables-screening/m-p/651946#M833</link>
      <description>&lt;P&gt;Re: Predictive Modeling Using Logistic Regression&lt;/P&gt;
&lt;P&gt;Just to confirm my understanding of the following statement at the bottom of page 3.47 of the course text: "Very liberal univariate screening might be helpful when the number of clusters created in PROC VARCLUS is still relatively large".&lt;/P&gt;
&lt;P&gt;Does "liberal univariate screening" mean that it is better to err on the side of allowing more inputs through the screening and then rely on regression selection techniques to find the best predictors?&lt;/P&gt;</description>
      <pubDate>Sat, 30 May 2020 08:54:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Clarification-of-statement-on-variables-screening/m-p/651946#M833</guid>
      <dc:creator>pvareschi</dc:creator>
      <dc:date>2020-05-30T08:54:54Z</dc:date>
    </item>
    <item>
      <title>Re: Clarification of statement on variables screening</title>
      <link>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Clarification-of-statement-on-variables-screening/m-p/652300#M855</link>
      <description>&lt;P&gt;If you plan to use the best subset selection method in PROC LOGISTIC, you need to get the number of predictor variables down to around 50 or else the CPU time will be fairly excessive. Therefore, if the number of clusters obtained by PROC VARCLUS is much greater than 50, then further screening methods such as the Spearman and Hoeffding correlation statistics could be used to further reduce the number of predictor variables available to PROC LOGISTIC. Very liberal univariate screening simply means reducing the number of variables down to a reasonable number.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 01 Jun 2020 17:05:05 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Academy-for-Data-Science/Clarification-of-statement-on-variables-screening/m-p/652300#M855</guid>
      <dc:creator>sasmlp</dc:creator>
      <dc:date>2020-06-01T17:05:05Z</dc:date>
    </item>
  </channel>
</rss>

