<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Logistic regression or GLM in SAS Procedures</title>
    <link>https://communities.sas.com/t5/SAS-Procedures/Logistic-regression-or-GLM/m-p/129465#M35299</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Actually I am also rookie of statistical theory. But I don't understand why you want use yd ,yr to be FREQ ? That couldn't be . And your code of logistic mode is also not look good, Did you check it more in the documentation ?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;proc reg only be used to sequential data not categorical data ,therefore i think it is not a good idea .or you should try to use proc glm .&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Ksharp&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Mon, 19 Nov 2012 02:42:35 GMT</pubDate>
    <dc:creator>Ksharp</dc:creator>
    <dc:date>2012-11-19T02:42:35Z</dc:date>
    <item>
      <title>Logistic regression or GLM</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Logistic-regression-or-GLM/m-p/129464#M35298</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&lt;SPAN style="font-style: inherit; font-family: Calibri, sans-serif;"&gt;I am new to logistic and GLM procedures, and therefore I have some syntactical and conceptual questions:&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&lt;SPAN style="font-style: inherit; font-family: Calibri, sans-serif;"&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&lt;SPAN style="font-style: inherit; font-family: Calibri, sans-serif;"&gt;I have a dataset(attached to this post) which has information about the salary and various other important characteristics of all faculty (n=52) in a college.&amp;nbsp; The descriptions of the variables are as follows:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&lt;SPAN style="font-style: inherit; font-family: Calibri, sans-serif;"&gt;OBS: &lt;/SPAN&gt;&lt;SPAN style="font-style: inherit; font-family: Calibri, sans-serif;"&gt;observation #&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&lt;SPAN style="font-style: inherit; font-family: Calibri, sans-serif;"&gt;SX: &lt;/SPAN&gt;&lt;SPAN style="font-style: inherit; font-family: Calibri, sans-serif;"&gt;sex (0=Male, 1=Female)&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&lt;SPAN style="font-style: inherit; font-family: Calibri, sans-serif;"&gt;RK: &lt;/SPAN&gt;&lt;SPAN style="font-style: inherit; font-family: Calibri, sans-serif;"&gt;rank (1=Assistant Professor, 2=Associate Professor, 3=Full Professor)&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&lt;SPAN style="font-style: inherit; font-family: Calibri, sans-serif;"&gt;YR: &lt;/SPAN&gt;&lt;SPAN style="font-style: inherit; font-family: Calibri, sans-serif;"&gt;# years in current rank&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&lt;SPAN style="font-style: inherit; font-family: Calibri, sans-serif;"&gt;DG: &lt;/SPAN&gt;&lt;SPAN style="font-style: inherit; font-family: Calibri, sans-serif;"&gt;highest degree (0=Masters, 1=Doctorate)&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&lt;SPAN style="font-style: inherit; font-family: Calibri, sans-serif;"&gt;YD: &lt;/SPAN&gt;&lt;SPAN style="font-style: inherit; font-family: Calibri, sans-serif;"&gt;# years since highest degree earned&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&lt;SPAN style="font-style: inherit; font-family: Calibri, sans-serif;"&gt;SL: &lt;/SPAN&gt;&lt;SPAN style="font-style: inherit; font-family: Calibri, sans-serif;"&gt;academic year salary ($)&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&lt;SPAN style="font-style: inherit; font-family: Calibri, sans-serif;"&gt;I need to determine if gender is associated with rank, highest degree, number of years in current rank, number of years since highest degree earned, and academic year salary.&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&lt;SPAN style="font-style: inherit; font-family: Calibri, sans-serif;"&gt;Since my gender is a binary outcome, I have used logistic regression to address the question. However I am getting a result where all my predictors seem highly significant which does not look to be correct. Am I approaching this question correctly or is my syntax not correct? Should I be using GLM?&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&lt;SPAN style="font-style: inherit; font-family: Calibri, sans-serif;"&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&lt;SPAN style="font-style: inherit; font-family: Calibri, sans-serif;"&gt;My code is as follows:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;proc logistic data=discrimination;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;freq yd;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;freq yr;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;class rk dg;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;model sx(descending) =rk yr dg yd sl;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;run;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;Another question that I am addressing is:&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;2. Is there a significant relationship between rank and academic year salary?&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;I am using a simple regression model. Here I have assigned rank as X (categorical) and salary as Y(continuous). Am I doing this correctly?&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;Below is the code:&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;proc reg data=discrimination SIMPLE;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;model SL = rk;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;run;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;&lt;/P&gt;&lt;P style="font-family: 'Helvetica Neue', Helvetica, Arial, 'Lucida Grande', sans-serif; background-color: #ffffff;"&gt;Thanks in advance for your suggestions!&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 18 Nov 2012 16:58:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Logistic-regression-or-GLM/m-p/129464#M35298</guid>
      <dc:creator>Biobee</dc:creator>
      <dc:date>2012-11-18T16:58:35Z</dc:date>
    </item>
    <item>
      <title>Re: Logistic regression or GLM</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Logistic-regression-or-GLM/m-p/129465#M35299</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Actually I am also rookie of statistical theory. But I don't understand why you want use yd ,yr to be FREQ ? That couldn't be . And your code of logistic mode is also not look good, Did you check it more in the documentation ?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;proc reg only be used to sequential data not categorical data ,therefore i think it is not a good idea .or you should try to use proc glm .&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Ksharp&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 19 Nov 2012 02:42:35 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Logistic-regression-or-GLM/m-p/129465#M35299</guid>
      <dc:creator>Ksharp</dc:creator>
      <dc:date>2012-11-19T02:42:35Z</dc:date>
    </item>
    <item>
      <title>Re: Logistic regression or GLM</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Logistic-regression-or-GLM/m-p/129466#M35300</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;While this method may work (in the sense that you get a solution), I think you might have reversed the roles of independent and dependent variable, based on your statement "I need to determine if gender is associated with rank, etc.".&amp;nbsp; I would think that you might want to just know if the average rank, number of years, etc. differ for males and females.&amp;nbsp; Thus, for the ordinal responses (rank and highest degree), PROC FREQ would probably be the straightforward analysis.&amp;nbsp; For the interval responses (YR, YD, YL) as the dependent variable, I would start with PROC GLM, but pay particular attention to the distribution of the residuals.&amp;nbsp; If the residuals deviate a lot from normality (and I would use QQ plots to determine this rather than normality tests), I would move to a procedure that could capture the distribution, such as PROC GENMOD or PROC GLIMMIX.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Steve Denham&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 19 Nov 2012 12:37:50 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Logistic-regression-or-GLM/m-p/129466#M35300</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2012-11-19T12:37:50Z</dc:date>
    </item>
    <item>
      <title>Re: Logistic regression or GLM</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Logistic-regression-or-GLM/m-p/129467#M35301</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Biobee,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;In addition to Steve's comments, I would also caution that your sample size is extremely small (N=52) so you are unlikely to be able to do more than univariable analyses.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;[The reason that everything was significant in your initial PROC LOGISTIC is the FREQ statements.&amp;nbsp; The FREQ statement treats those variables as observation multipliers, so you effective sample size became many thousands instead of 52.]&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Doc Muhlbaier&lt;/P&gt;&lt;P&gt;Duke&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 19 Nov 2012 14:21:11 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Logistic-regression-or-GLM/m-p/129467#M35301</guid>
      <dc:creator>Doc_Duke</dc:creator>
      <dc:date>2012-11-19T14:21:11Z</dc:date>
    </item>
  </channel>
</rss>

