BookmarkSubscribeRSS Feed
JerryChang__
Calcite | Level 5

I have some questions I'd like to ask the experts.

1. I read some articles about methods for handling longitude data, like The SEM approach to longitude data analysis using Calis procedure by An and Yung, but they didn't use path analysis. If I want to use path analysis to analyze longitude data, how should I go about it?"

2.Additionally, my variables include both binary and continuous types. How should I handle this? I find some way that use METHOD=ML to deal with it, is that correct? Basically, I think the outcome should be a little bit like Logistic Regression.

 

Bottom part is my code, but this is not longitude data, and I have no idea that this way can deal with binary variables (TNT and HF is binary).

For short the parts, I delete some similar code.

proc calis covariance outmodel=_eqparms_ method = MLM ;
lineqs
TNT = pTNTDM DM +pTNTLVEF LVEF+e_TNT,
HF=pHFTNT TNT +pHFbmi bmi+pHFDM DM+e_HF;

var TnT tempme bmi dm age lvef egfr msbp edvi_di HF;
run;
proc print data=_eqparms_;
run;

 
4 REPLIES 4
jiltao
SAS Super FREQ

PROC CALIS cannot do link functions, so no logistic type of model is possible in PROC CALIS.

For binary endogenous variables, you might find this usage note helpful --

http://support.sas.com/kb/22529

For path analysis for longitudinal data with continuous response, you might find this example helpful --

https://go.documentation.sas.com/doc/en/pgmsascdc/9.4_3.4/statug/statug_calis_examples61.htm

Hope this helps,

Jill

 

JerryChang__
Calcite | Level 5

Hi Jill,

Thanks for your feedback, I found the fact at the same place that is why I originally use METHOD=MLM to cope with the binary outcome, so I think that is fine.

 

But the second one is about latent factors about latent curve model, in fact, I want to find variables effect other variables, and find out their influence.

For example, like the image, x1 effect x3, x3 effect x4, x2 effect x4, x1 effect x2 each other's. And x1-x4 individually, have four times repeated measure data. Does the latent curve model have the same function?

FEDE65FC-97B2-40A1-82D3-282228130D99.jpg

Jerry 

Best Wish

jiltao
SAS Super FREQ

Jerry,

You could write your model for each time point. Variables are denoted by _t1,... _t4 for each of them.

 

Below is some information I provided previously for a user's question. The path is not the same as yours but the basic idea should be the same.

 

You might specify the following model for time point 1 --
LINEQS  
   V1_t1  =  b12 * V2_t1  + b13 * V3_t1  + E1,
   V2_t1  =  b24 * V4_t1  + b25 * V5_t1  + b26 * V6_t1 + b27 * V7_t1 + E2,
   V3_t1  =  b34 * V4_t1  + b35 * V5_t1  + b36 * V6_t1 + b37 * V7_t1 + E3;

 

Now, suppose you have two time points, the problem becomes more complicated. Suppose that initially, you simply stack the time_2 variable models to the previous specifications:


LINEQS

   V1_t1  =  b12 * V2_t1  + b13 * V3_t1  + E1,
   V2_t1  =  b24 * V4_t1  + b25 * V5_t1  + b26 * V6_t1 + b27 * V7_t1  + E2,
   V3_t1  =  b34 * V4_t1  + b35 * V5_t1  + b36 * V6_t1 + b37 * V7_t1  + E3

   V1_t2  =  b12_t2 * V2_t2  + b13_t2 * V3_t2  + E4,
   V2_t2  =  b24_t2 * V4_t2  + b25_t2 * V5_t2  + b26_t2 * V6_t2 + b27_t2 * V7_t2 + E5,
   V3_t2  =  b34_t2 * V4_t2  + b35_t2 * V5_t2  + b36_t2 * V6_t2 + b37_t2 * V7_t2 + + E6;

Here you also add a new set of coefficients with suffix “_t2”. But this is not necessary if your theory would hypothesize b’s is invariant across time, that is b=b_t2 for all b’s.  

Then, you can ask that wouldn’t it make sense to assume V1_t2 is predictable by v1_t1? Sure, then you modify the fourth equation by adding an effect:
   V1_t2  =  b12_t2 * V2_t2  + b13_t2 * V3_t2  +  a11_t21 * V1_t1  + E4,
But can V1_t2 be affected by V2_t1 too? If so, you could do:
    V1_t2  =  b12_t2 * V2_t2  + b13_t2 * V3_t2  +  a11_t21 * V1_t1 + a12_t21 *V2_t1  + E4,
 

Then you can keep asking about the effects of other t1 variables on V1_t2. You might expand the above equation by including non-negligible effects---but you must make your own judgment about what effects are negligible or not. Then, the next consideration is about the modification of equations for V2_t2 and V3_t2---basically, you are trying to specify how variables in different time periods could relate to each other. You might also add equations for other t2 variables like:
V4_t2 = * V4_t1 + … + E7

Yes, this will become very tedious and difficult to manage! But this is the precisely nature of using a very general modeling method to study repeated multivariate measures.
 

CALIS or SEM is a flexible modeling tool that you cannot expect one to tell you a generic model to fit. For longitudinal data, there are quite many possibilities.  Questions that might guide you to specify the model are:
 1. Can I assume some effects to be invariant across time points (e.g, b12 = b12_t1 = b12_t2)?

2. Are variables at time point 2 affected by variables at time point 1?
3. Can my research questions be answered by simpler statistical techniques? SEM is not the best modeling techniques in all occasions.

 

I recommend you to try out some models with fewer time points (say, the first,  and the last) and see if you can understand the modeling and interpret the results. Then you can build more complicated models.

 

Hope this helps,

Jill Tao

JerryChang__
Calcite | Level 5

Hi Jill,

That's really helpful for me. I will try it again. 

Jerry,

Best wishes

SAS Innovate 2025: Call for Content

Are you ready for the spotlight? We're accepting content ideas for SAS Innovate 2025 to be held May 6-9 in Orlando, FL. The call is open until September 25. Read more here about why you should contribute and what is in it for you!

Submit your idea!

What is ANOVA?

ANOVA, or Analysis Of Variance, is used to compare the averages or means of two or more populations to better understand how they differ. Watch this tutorial for more.

Find more tutorials on the SAS Users YouTube channel.

Discussion stats
  • 4 replies
  • 1036 views
  • 4 likes
  • 2 in conversation