<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Logistic Regression using CATMOD in SAS Procedures</title>
    <link>https://communities.sas.com/t5/SAS-Procedures/Logistic-Regression-using-CATMOD/m-p/11041#M1166</link>
    <description>Dear all, I'm having some trouble using CATMOD instead of Proc Logistic. Normally I would use Proc Logistic, but it appears that it crashes when the amount of variables grows to large,  meanwhile CATMOD remains stable and fast. So I must use CATMOD instead.&lt;BR /&gt;
&lt;BR /&gt;
Now I have the following code:&lt;BR /&gt;
---&lt;BR /&gt;
&lt;BR /&gt;
PROC LOGISTIC DATA=My_Data&lt;BR /&gt;
		PLOTS(ONLY)=ALL&lt;BR /&gt;
&lt;BR /&gt;
	;&lt;BR /&gt;
	CLASS GNDR 	(PARAM=EFFECT) BLGETMG 	(PARAM=EFFECT) EDULVL 	(PARAM=EFFECT) EDULVLM 	(PARAM=EFFECT) EDULVLF 	(PARAM=EFFECT) EDCTN 	(PARAM=EFFECT) Head_Unemployed 	(PARAM=EFFECT) DSBLD 	(PARAM=EFFECT) RTRD 	(PARAM=EFFECT) Number_Of_Children_SEC (PARAM=EFFECT) HHMMB (PARAM=EFFECT);&lt;BR /&gt;
	WEIGHT Weight_Household_2009;&lt;BR /&gt;
	MODEL In_Poverty (Event = '1')= GNDR BLGETMG EDULVL EDULVLM  EDULVLF EDCTN Head_Unemployed DSBLD RTRD Number_Of_Children_SEC HHMMB		&lt;BR /&gt;
/&lt;BR /&gt;
		LINK=LOGIT&lt;BR /&gt;
		ALPHA=0.10&lt;BR /&gt;
	;&lt;BR /&gt;
RUN;&lt;BR /&gt;
&lt;BR /&gt;
proc catmod data=My_Data;&lt;BR /&gt;
  response clogits;&lt;BR /&gt;
  model In_Poverty (Event = '1') = GNDR BLGETMG EDULVL EDULVLM EDULVLF EDCTN Head_Unemployed DSBLD RTRD Number_Of_Children_SEC HHMMB / alpha=0.10;&lt;BR /&gt;
  weight Weight_Household_2009;&lt;BR /&gt;
run;&lt;BR /&gt;
---&lt;BR /&gt;
&lt;BR /&gt;
BUT running both of these Logstic Regression provides different estimates for my parameters and I don't understand why. The variables are exactly the same, and the link functions should match. But still they give me different results. Why?&lt;BR /&gt;
&lt;BR /&gt;
- Julian.</description>
    <pubDate>Wed, 16 Feb 2011 12:03:31 GMT</pubDate>
    <dc:creator>deleted_user</dc:creator>
    <dc:date>2011-02-16T12:03:31Z</dc:date>
    <item>
      <title>Logistic Regression using CATMOD</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Logistic-Regression-using-CATMOD/m-p/11041#M1166</link>
      <description>Dear all, I'm having some trouble using CATMOD instead of Proc Logistic. Normally I would use Proc Logistic, but it appears that it crashes when the amount of variables grows to large,  meanwhile CATMOD remains stable and fast. So I must use CATMOD instead.&lt;BR /&gt;
&lt;BR /&gt;
Now I have the following code:&lt;BR /&gt;
---&lt;BR /&gt;
&lt;BR /&gt;
PROC LOGISTIC DATA=My_Data&lt;BR /&gt;
		PLOTS(ONLY)=ALL&lt;BR /&gt;
&lt;BR /&gt;
	;&lt;BR /&gt;
	CLASS GNDR 	(PARAM=EFFECT) BLGETMG 	(PARAM=EFFECT) EDULVL 	(PARAM=EFFECT) EDULVLM 	(PARAM=EFFECT) EDULVLF 	(PARAM=EFFECT) EDCTN 	(PARAM=EFFECT) Head_Unemployed 	(PARAM=EFFECT) DSBLD 	(PARAM=EFFECT) RTRD 	(PARAM=EFFECT) Number_Of_Children_SEC (PARAM=EFFECT) HHMMB (PARAM=EFFECT);&lt;BR /&gt;
	WEIGHT Weight_Household_2009;&lt;BR /&gt;
	MODEL In_Poverty (Event = '1')= GNDR BLGETMG EDULVL EDULVLM  EDULVLF EDCTN Head_Unemployed DSBLD RTRD Number_Of_Children_SEC HHMMB		&lt;BR /&gt;
/&lt;BR /&gt;
		LINK=LOGIT&lt;BR /&gt;
		ALPHA=0.10&lt;BR /&gt;
	;&lt;BR /&gt;
RUN;&lt;BR /&gt;
&lt;BR /&gt;
proc catmod data=My_Data;&lt;BR /&gt;
  response clogits;&lt;BR /&gt;
  model In_Poverty (Event = '1') = GNDR BLGETMG EDULVL EDULVLM EDULVLF EDCTN Head_Unemployed DSBLD RTRD Number_Of_Children_SEC HHMMB / alpha=0.10;&lt;BR /&gt;
  weight Weight_Household_2009;&lt;BR /&gt;
run;&lt;BR /&gt;
---&lt;BR /&gt;
&lt;BR /&gt;
BUT running both of these Logstic Regression provides different estimates for my parameters and I don't understand why. The variables are exactly the same, and the link functions should match. But still they give me different results. Why?&lt;BR /&gt;
&lt;BR /&gt;
- Julian.</description>
      <pubDate>Wed, 16 Feb 2011 12:03:31 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Logistic-Regression-using-CATMOD/m-p/11041#M1166</guid>
      <dc:creator>deleted_user</dc:creator>
      <dc:date>2011-02-16T12:03:31Z</dc:date>
    </item>
    <item>
      <title>Re: Logistic Regression using CATMOD</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Logistic-Regression-using-CATMOD/m-p/11042#M1167</link>
      <description>Correction to the CATMOD code:&lt;BR /&gt;
---&lt;BR /&gt;
proc catmod data=My_Data;&lt;BR /&gt;
response clogits;&lt;BR /&gt;
model In_Poverty = GNDR BLGETMG EDULVL EDULVLM EDULVLF EDCTN Head_Unemployed DSBLD RTRD Number_Of_Children_SEC HHMMB / alpha=0.10;&lt;BR /&gt;
weight Weight_Household_2009;&lt;BR /&gt;
run;&lt;BR /&gt;
---'&lt;BR /&gt;
&lt;BR /&gt;
- Julian.</description>
      <pubDate>Wed, 16 Feb 2011 12:04:30 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Logistic-Regression-using-CATMOD/m-p/11042#M1167</guid>
      <dc:creator>deleted_user</dc:creator>
      <dc:date>2011-02-16T12:04:30Z</dc:date>
    </item>
    <item>
      <title>Re: Logistic Regression using CATMOD</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Logistic-Regression-using-CATMOD/m-p/11043#M1168</link>
      <description>This is a guess.  Repeat, this is only a guess. (Actually, three guesses).&lt;BR /&gt;
1.  PROC LOGISTIC and PROC CATMOD use very different parameterizations for class variables.  Could this be the source of your differences?&lt;BR /&gt;
2.  A second guess is that PROC LOGISTIC uses a maximum likelihood algorithm and CATMOD a weighted least squares.  That could also lead to differences.&lt;BR /&gt;
3. Finally, it may be that you need to specify the class variables in a DIRECT statement in CATMOD.  We are now beyond my experience level.&lt;BR /&gt;
&lt;BR /&gt;
I hope at least one of these leads you to some resolution.&lt;BR /&gt;
&lt;BR /&gt;
Steve Denham</description>
      <pubDate>Wed, 16 Feb 2011 12:51:51 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Logistic-Regression-using-CATMOD/m-p/11043#M1168</guid>
      <dc:creator>SteveDenham</dc:creator>
      <dc:date>2011-02-16T12:51:51Z</dc:date>
    </item>
    <item>
      <title>Re: Logistic Regression using CATMOD</title>
      <link>https://communities.sas.com/t5/SAS-Procedures/Logistic-Regression-using-CATMOD/m-p/11044#M1169</link>
      <description>Thank you for the fast reply Steve.&lt;BR /&gt;
&lt;BR /&gt;
However I fixed the problem myself. Turns out Proc Logistic and Proc CATMOD are both parametrized the same way in my code, they both use maximum likelihood and the direct statement is only for continous variables in CATMOD.... BUT they optimize differently, and treat "bad variables" (estimates that go to infinity etc...) very differently. Once I removed all these bad variables the estimates matched to a very close numerical precision. It also appears that the numerical differences between CATMOD and Logistic approach zero as the number of distinct cases for each class variable increases.&lt;BR /&gt;
&lt;BR /&gt;
I've posted my code again in case others run into the same problems:&lt;BR /&gt;
---&lt;BR /&gt;
PROC LOGISTIC DATA=SASUSER.FILTER_FOR_CON_DATA_FORMATT_0005&lt;BR /&gt;
		PLOTS(ONLY)=ALL&lt;BR /&gt;
&lt;BR /&gt;
	;&lt;BR /&gt;
	CLASS GNDR 	(PARAM=EFFECT) Head_Unemployed (PARAM=EFFECT) Number_Of_Children_SEC (PARAM=EFFECT) HHMMB		(PARAM=EFFECT) ;&lt;BR /&gt;
	WEIGHT Weight_Household_2009;&lt;BR /&gt;
	MODEL In_Poverty (Event = '1')= GNDR Head_Unemployed Number_Of_Children_SEC HHMMB		&lt;BR /&gt;
/&lt;BR /&gt;
		LINK=logit&lt;BR /&gt;
		ALPHA=0.10&lt;BR /&gt;
	;&lt;BR /&gt;
RUN;&lt;BR /&gt;
&lt;BR /&gt;
proc catmod data=SASUSER.FILTER_FOR_CON_DATA_FORMATT_0005;&lt;BR /&gt;
  response clogits;&lt;BR /&gt;
  model In_Poverty = GNDR Head_Unemployed Number_Of_Children_SEC HHMMB / alpha=0.10;&lt;BR /&gt;
  weight Weight_Household_2009;&lt;BR /&gt;
run;&lt;BR /&gt;
---</description>
      <pubDate>Wed, 16 Feb 2011 17:15:44 GMT</pubDate>
      <guid>https://communities.sas.com/t5/SAS-Procedures/Logistic-Regression-using-CATMOD/m-p/11044#M1169</guid>
      <dc:creator>deleted_user</dc:creator>
      <dc:date>2011-02-16T17:15:44Z</dc:date>
    </item>
  </channel>
</rss>

