<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Logistic regression or GLM in Statistical Procedures</title>
    <link>https://communities.sas.com/t5/Statistical-Procedures/Logistic-regression-or-GLM/m-p/129377#M6799</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;I am new to logistic and GLM procedures, and therefore I have some syntactical and conceptual questions:&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;I have a dataset(attached to this post) which has information about the salary and various other important characteristics of all faculty (n=52) in a college.&amp;nbsp; The descriptions of the variables are as follows:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;OBS: &lt;/SPAN&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;observation #&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;SX: &lt;/SPAN&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;sex (0=Male, 1=Female)&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;RK: &lt;/SPAN&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;rank (1=Assistant Professor, 2=Associate Professor, 3=Full Professor)&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;YR: &lt;/SPAN&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;# years in current rank&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;DG: &lt;/SPAN&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;highest degree (0=Masters, 1=Doctorate)&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;YD: &lt;/SPAN&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;# years since highest degree earned&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;SL: &lt;/SPAN&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;academic year salary ($)&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: Calibri, sans-serif;"&gt;I need to determine if gender is associated with rank, highest degree, number of years in current rank, number of years since highest degree earned, and academic year salary. &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;Since my gender is a binary outcome, I have used logistic regression to address the question. However I am getting a result where all my predictors seem highly significant which does not look to be correct. Am I approaching this question correctly or is my syntax not correct? Should I be using GLM?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;My code is as follows:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;proc logistic data=discrimination; &lt;/P&gt;&lt;P&gt;freq yd;&lt;/P&gt;&lt;P&gt;freq yr;&lt;/P&gt;&lt;P&gt;class rk dg;&lt;/P&gt;&lt;P&gt;model sx(descending) =rk yr dg yd sl;&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Another question that I am addressing is:&lt;/P&gt;&lt;P&gt;2. Is there a significant relationship between rank and academic year salary?&lt;/P&gt;&lt;P&gt;I am using a simple regression model. Here I have assigned rank as X (categorical) and salary as Y(continuous). Am I doing this correctly?&lt;/P&gt;&lt;P&gt;Below is the code:&lt;/P&gt;&lt;P&gt;proc reg data=discrimination SIMPLE;&lt;/P&gt;&lt;P&gt; model SL = rk;&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Sun, 18 Nov 2012 04:50:43 GMT</pubDate>
    <dc:creator>Biobee</dc:creator>
    <dc:date>2012-11-18T04:50:43Z</dc:date>
    <item>
      <title>Logistic regression or GLM</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Logistic-regression-or-GLM/m-p/129377#M6799</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;I am new to logistic and GLM procedures, and therefore I have some syntactical and conceptual questions:&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;I have a dataset(attached to this post) which has information about the salary and various other important characteristics of all faculty (n=52) in a college.&amp;nbsp; The descriptions of the variables are as follows:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;OBS: &lt;/SPAN&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;observation #&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;SX: &lt;/SPAN&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;sex (0=Male, 1=Female)&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;RK: &lt;/SPAN&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;rank (1=Assistant Professor, 2=Associate Professor, 3=Full Professor)&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;YR: &lt;/SPAN&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;# years in current rank&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;DG: &lt;/SPAN&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;highest degree (0=Masters, 1=Doctorate)&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;YD: &lt;/SPAN&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;# years since highest degree earned&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;SL: &lt;/SPAN&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;academic year salary ($)&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: Calibri, sans-serif;"&gt;I need to determine if gender is associated with rank, highest degree, number of years in current rank, number of years since highest degree earned, and academic year salary. &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;Since my gender is a binary outcome, I have used logistic regression to address the question. However I am getting a result where all my predictors seem highly significant which does not look to be correct. Am I approaching this question correctly or is my syntax not correct? Should I be using GLM?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: 'Calibri','sans-serif';"&gt;My code is as follows:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;proc logistic data=discrimination; &lt;/P&gt;&lt;P&gt;freq yd;&lt;/P&gt;&lt;P&gt;freq yr;&lt;/P&gt;&lt;P&gt;class rk dg;&lt;/P&gt;&lt;P&gt;model sx(descending) =rk yr dg yd sl;&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Another question that I am addressing is:&lt;/P&gt;&lt;P&gt;2. Is there a significant relationship between rank and academic year salary?&lt;/P&gt;&lt;P&gt;I am using a simple regression model. Here I have assigned rank as X (categorical) and salary as Y(continuous). Am I doing this correctly?&lt;/P&gt;&lt;P&gt;Below is the code:&lt;/P&gt;&lt;P&gt;proc reg data=discrimination SIMPLE;&lt;/P&gt;&lt;P&gt; model SL = rk;&lt;/P&gt;&lt;P&gt;run;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 18 Nov 2012 04:50:43 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Logistic-regression-or-GLM/m-p/129377#M6799</guid>
      <dc:creator>Biobee</dc:creator>
      <dc:date>2012-11-18T04:50:43Z</dc:date>
    </item>
    <item>
      <title>Re: Logistic regression or GLM</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Logistic-regression-or-GLM/m-p/129378#M6800</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;A few initial remarks:&lt;/P&gt;&lt;P&gt;Your use of the freq statement is incorrect.&amp;nbsp; You would only use it if you had n identical instances which were represented in one obsevation of your data, with a frequency of n.&lt;/P&gt;&lt;P&gt;Modelling sex as if it were a dependent attribute is a bit perverse.&amp;nbsp; I would expect you to model rank or salary on some set of the other indicators.&amp;nbsp; To use logistic on salary you would have to segment the data, perhaps as low, mid or high.&lt;/P&gt;&lt;P&gt;I'll leave the finer points to others.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Richard in Oz&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 18 Nov 2012 06:44:54 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Logistic-regression-or-GLM/m-p/129378#M6800</guid>
      <dc:creator>RichardinOz</dc:creator>
      <dc:date>2012-11-18T06:44:54Z</dc:date>
    </item>
    <item>
      <title>Re: Logistic regression or GLM</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Logistic-regression-or-GLM/m-p/129379#M6801</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi Richard&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks for replying to my post and your suggestion for not using freq. With regards to using gender as the outcome variable, I agree with your point of view, however I cannot change what the question requires. So will have to work with gender as outcome.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 18 Nov 2012 17:01:12 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Logistic-regression-or-GLM/m-p/129379#M6801</guid>
      <dc:creator>Biobee</dc:creator>
      <dc:date>2012-11-18T17:01:12Z</dc:date>
    </item>
    <item>
      <title>Re: Logistic regression or GLM</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Logistic-regression-or-GLM/m-p/129380#M6802</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;You state your question as :&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: Calibri, sans-serif; background-color: #ffffff;"&gt;I need to determine if gender is associated with rank, highest degree, number of years in current rank, number of years since highest degree earned, and academic year salary.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: Calibri, sans-serif; background-color: #ffffff;"&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: Calibri, sans-serif; background-color: #ffffff;"&gt;What did your univariate comparison say for each variable before multivariate model?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: Calibri, sans-serif; background-color: #ffffff;"&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: Calibri, sans-serif; background-color: #ffffff;"&gt;Second of all, I'm with RichardinOz, your outcome shouldn't be gender, that is an dependent variable the outcome is something else.&amp;nbsp; &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="font-family: Calibri, sans-serif; background-color: #ffffff;"&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN style="background-color: #ffffff; font-family: Calibri, sans-serif;"&gt;Association&lt;/SPAN&gt;&lt;SPAN style="background-color: #ffffff; font-family: Calibri, sans-serif;"&gt; doesn't have to mean the variable is the independent variable.&lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sun, 18 Nov 2012 19:46:15 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Logistic-regression-or-GLM/m-p/129380#M6802</guid>
      <dc:creator>Reeza</dc:creator>
      <dc:date>2012-11-18T19:46:15Z</dc:date>
    </item>
    <item>
      <title>Re: Logistic regression or GLM</title>
      <link>https://communities.sas.com/t5/Statistical-Procedures/Logistic-regression-or-GLM/m-p/129381#M6803</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;You say "I cannot change what the question requires. So will have to work with gender as outcome."&lt;/P&gt;&lt;P&gt;I disagree.&amp;nbsp; Having gender as the outcome implies you have a population which undergoes sex change as it progresses through rank, academic outcome and salary. &lt;/P&gt;&lt;P&gt;As an analyst you have a responsibility to challenge incorrect assumptions.&amp;nbsp; Otherwise you are little better than a 'script kiddie' throwing code at data in the hope that something sticks.&amp;nbsp; Who is asking the question?&amp;nbsp; Go back to them and get them to restate the problem.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Richard in Oz&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 19 Nov 2012 00:15:26 GMT</pubDate>
      <guid>https://communities.sas.com/t5/Statistical-Procedures/Logistic-regression-or-GLM/m-p/129381#M6803</guid>
      <dc:creator>RichardinOz</dc:creator>
      <dc:date>2012-11-19T00:15:26Z</dc:date>
    </item>
  </channel>
</rss>

