BookmarkSubscribeRSS Feed
deleted_user
Not applicable
I ran PROC LOGISTIC , ( all of my predictors were categorical) my model being:
MODEL PASS (EVENT ='1') =

DEV_SMITH
DEV_JONES
DEV_WILL
GEN_JACKSON
GEN_DAVIS
GEN_CORNELL
;

In the output I got this message:

Note: The following parameters have been set to 0, since the variables are a linear combination of other variables as shown.

DEV_WILL = Intercept - DEV_SMITH - DEV_JONES
GEN_CORNELL = Intercept - GEN_JACKSON - GEN_DAVIS

I want to interpret this to mean that there was multicollinearity between the independent predictors and SAS discovered it and eliminated the variables as a result. i.e. 'fixing the problem'.

Is this correct, or do I have something else to worry about?
Thanks.
2 REPLIES 2
Doc_Duke
Rhodochrosite | Level 12
It goes beyond multicollinearity to an algebraic identity.
n6
Quartz | Level 8 n6
Quartz | Level 8
It means that since the values for GEN JACKSON and GEN DAVIS along with the Intercept completely determine the value of GEN CORNELL, having a term for GEN CORNELL in the model is redundant. GEN CORNELL will serve as the referent and the results for GEN JACKSON tells how GEN JACKSON differs from GEN CORNELL. If the results for GEN JACKSON are significant then it does not mean that GEN JACKSON is different from 0 but rather than GEN JACKSON is different from GEN CORNELL.

If you wanted to, instead of putting three variables for GEN in the model, you could have one variable named GEN (or whatever) that has three values (JACKSON, DAVIS and CORNELL). Then you could put that variable in a class statement and also in the model. You'd have

class gen;
model PASS (EVENT ='1') = gen;

If you wanted to use reference cell coding you could change the class statement to

class gen / param=ref;

I think this will automatically make the highest value of the variable be the referent, which in this case is GEN JACKSON (because JACKSON comes alphabetically later than DAVIS or CORNELL).

If instead you wanted CORNELL to be the referent you could use

class gen (ref=first) / param=ref;

And if you wanted the middle value of the three, DAVIS, to be the referent then there is a way to do that too but I forget right now. It's in the SAS documentation though.

sas-innovate-2024.png

Join us for SAS Innovate April 16-19 at the Aria in Las Vegas. Bring the team and save big with our group pricing for a limited time only.

Pre-conference courses and tutorials are filling up fast and are always a sellout. Register today to reserve your seat.

 

Register now!

What is Bayesian Analysis?

Learn the difference between classical and Bayesian statistical approaches and see a few PROC examples to perform Bayesian analysis in this video.

Find more tutorials on the SAS Users YouTube channel.

Click image to register for webinarClick image to register for webinar

Classroom Training Available!

Select SAS Training centers are offering in-person courses. View upcoming courses for:

View all other training opportunities.

Discussion stats
  • 2 replies
  • 5303 views
  • 0 likes
  • 3 in conversation