BookmarkSubscribeRSS Feed
xtc283x
Quartz | Level 8

Besides Proc GLM, which SAS statistical or regression modeling procedures allow the specification of multiple dependent variables?

31 REPLIES 31
xtc283x
Quartz | Level 8

Arthur-

Thanks. Somehow, the link you provided exploded in a window full of HTML.

art297
Opal | Level 21

I'll try and repost it here.  Remove the two embedded spaces from the following and copy and paste the url to your browser: www.ats.ucla.edu /stat/ sas/whatstat/

xtc283x
Quartz | Level 8

Arthur--Thanks, I always forget about canonical correlation. That's a good cross-software comparison, focused on OLS. Tom

lvm
Rhodochrosite | Level 12 lvm
Rhodochrosite | Level 12

Different possibilities. What is your goal? Simultaneous modeling of all the (possibly correlated) response variables or just having a convenient way of getting separate analyses for several response variables? Check out PROC PLS (partial least squares).

xtc283x
Quartz | Level 8

LVM-

Simultaneous modeling...as in MANOVA but extended to approaches such as hierarchical mixture models (e.g., Proc Mixed with multiple DVs which this procedure won't allow), maximum likelihood estimation for mixtures of DV forms, and so on. Proc PLS is kind of the idea but I prefer to obtain estimates that aren't collapsed across a PLS-derived component.

Make sense?

Thanks,

Tom

lvm
Rhodochrosite | Level 12 lvm
Rhodochrosite | Level 12

Lots of ways of doing this with proc mixed and glimmix for hierarchical models. This old article is technically about repeated measures in GLM and MIXED, but is ultimately a comparison of MANOVA in the two procedures.

https://support.sas.com/rnd/app/stat/papers/mixedglm.pdf

Here is a more recent blog on the topic:

http://blogs.sas.com/content/sastraining/2011/02/02/the-punchline-manova-or-a-mixed-model/

The syntax is different, and the data are stacked, for analysis in MIXED and GLIMMIX compared with GLM, but you can handle so many more situations with the mixed-model procedures. You are not even restricted to the same distribution. One of the nice examples in the User's Guide for GLIMMIX deals with a multivariate analysis of two random variables, one binary (Bernoulli) and the other count (Poisson).

SAS/STAT(R) 9.3 User's Guide

xtc283x
Quartz | Level 8

LVM-Interesting. The different data structures -- e.g., stacking up the "Y" variable and adding an "Age" predictor in Proc Mixed vs using "Y1-Y4" with a REPEATED statement in Proc GLM isn't quite what I have in mind. Proc Mixed (or Glimmix) still only permit a single "Y" or dependent variable.  Tom

lvm
Rhodochrosite | Level 12 lvm
Rhodochrosite | Level 12

No, you are incorrect. MIXED and GLIMMIX can absolutely handle multiple dependent variables. This is exactly what is demonstrated in the links I sent.

SteveDenham
Jade | Level 19

Light bulb goes on.  It all comes down to identifying a proper SUBJECT=.  Then with an unstructured or factor analytic covariance matrix you can literally have your cake (multivariate) and eat it too (hierarchical), and all you need is a proper machine capable of fitting the model.

Steve Denham

xtc283x
Quartz | Level 8

Steve-

  In thinking more about your SUBJECT= suggestion, it's not clear to me how the SUBJECT= option differs from the GROUP= option in the RANDOM statement. In other words, if one is modeling multiple dependent variables, would one approach be to treat the data stacks for the different DVs using the GROUP= option and the actual unit of analysis with the SUBJECT= option?

  Some clarification of this would be helpful!

Thanks,

Thomas

SteveDenham
Jade | Level 19

Thomas,

Larry has done a lot more in this area than I have, but I would try to avoid the use of the GROUP= option in a multivariate analysis because of the potential blow-up of the number of parameters to be estimated.  The Joint Modeling of Binary and Count Data example in the PROC GLIMMIX documentation is what I have been working off of, where 'dist' or 'parm' or something else is a fixed effect, modeled as a repeated measurement on an individual subject.  I am having a hard time figuring out how to interpret a GROUP='dist' under this approach.  Now if I understand Walt Stroup's book correctly, and model these as Zg inside the linear predictor, then maybe I can wrap my head around a GROUP= approach.  But I'm old, and that hurts my brain too much.

If someone has a worked example I could look at, that would help.

Steve Denham

lvm
Rhodochrosite | Level 12 lvm
Rhodochrosite | Level 12

As stated by Steve, group= would not be appropriate for your problem. With GROUP, you get separate parameter estimates (or random effect prediction) for each group. The groups are treated independently. Your multivariate responses are correlated. So, you should use subject=, the syntax for correlated data. For more advanced modeling, you can combine group and subject, but I won't get into that here.

xtc283x
Quartz | Level 8

Steve and lvm-

As always, thank you for your insightful comments.

Best,

Thomas

SAS Innovate 2025: Save the Date

 SAS Innovate 2025 is scheduled for May 6-9 in Orlando, FL. Sign up to be first to learn about the agenda and registration!

Save the date!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 31 replies
  • 4471 views
  • 7 likes
  • 6 in conversation